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Polynomials with all real roots 
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Chapter 




Polynomials in One Variable 



1.1 Notation and basic definitions 



We begin with the definitions of the various kinds of interlacing, the classes of 
polynomials of interest, and the properties of linear transformations that we 
shall be using. 

If f is a polynomial of degree n with all real roots, then define 

roots(f) = (ai,...,a n ) 

where ai < a2 < • • • < a n are the roots of f (x) = 0. If we write roots(f ) then 
we are assuming that f actually has all real roots. Given two polynomials f, g 
with roots(f) = (ai, ...,a n ) and roots(g) = (bi, ...,b m ) then we say that f 
and g interlace if these roots alternate. If all the roots are distinct then there are 
only four ways that this can happen: 



g < f Ql < bi < Q2 < b 2 < ■ ■ • < Q n < b n (1.1.1) 

g < f bi < ai < \>2 < <i2 < ■ ■ • < b n 

f <C g bi < ai < b 2 < a 2 < ■ ■ • < b n < a n 

f < g ai < bi < d2 < b2 < • • • < a n - 

We define g 3> f to mean that f <C g, and g > f to mean f < g. Notice that when 
we write f <C g we assume that both f and g have all real roots, which are 
necessarily distinct. If two polynomials interlace then their degrees differ by 
at most one. 

If the inequalities in il.l.H are relaxed, so that all inequalities "<" are re- 
placed by then we write g^Cf, g <f, f -C g and f < g respectively. We 
call interlacings with < or -C strict interfacings. It's easy to see that non-strict 
interlacings are limits of strict interlacings - see Remark l9.104l for a non ad hoc 
argument. 
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If f and g are two polynomials such that there is a polynomial h with f<h, 
9^h, (or h< g, h<f) then we say that f, g have a common interlacing. Unless 
stated otherwise, we assume that f and g have the same degree. 

The two interlacings f -C g and f < g have very similar properties. To elim- 
inate many special cases, we introduce the notation 

f < — g which means f -C g or f<g 

One way to remember this notation is to notice that f < — g means that 

• f and g interlace 

• The largest root belongs to f . 

• The arrow points to the polynomial with the largest root. 

There are several important subclasses of polynomials with all real roots. 

P = { all polynomials with all real roots} 
pP° s = { all polynomials with all real roots, all negative, 

and all coefficients are the same sign} 
P alt = { all polynomials with all real roots, all positive, 

and the coefficients alternate in sign} 

p± _ palt y ppos 

P 1 = { all polynomials with all real roots, all roots lying in a set 1} 

p± _ p(— oo,0] y p[0,oo) 

It is helpful to remember that that P pos has a superscript pos because when- 
ever the leading coefficient is positive all the coefficients are positive. If we 
want to only consider polynomials of degree n then we write P(n), P pos (n), 
and so on. 

We are particularly interested in properties of linear transformations of 
polynomials. Suppose that T is a linear transformation that maps polynomials 
to polynomials. 

• T preserves roots if whenever f has all real roots, so does Tf . Using the 
notation above, we can write this as T: P — ► P. 

• T preserves interlacing if whenever f and g interlace, so do Tf and Tg. 

• T preserves degree if the degrees of f and Tf are the same. 

• T preserves the sign of the leading coefficient if f and Tf have leading coeffi- 
cients of the same sign. 
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The quintessential interlacing theorem is a well known consequence of 
Rolle's Theorem. The proof given below analyzes the graph of f and f - see 
Remark l9.68l and Remark 1 15 .281 for proofs that do not involve any analysis of 
the graph. 

Theorem 1.1. If f is a polynomial with all real roots, and all these roots are distinct, 
then f <f . 

Proof. By Rolle's Theorem, there is a root of f between consecutive roots of f . 
If the degree of f is n, then there are n — 1 intervals between roots of f, so f ' 
has at least n — 1 roots. Since f ' has degree n — 1, it has exactly n — 1 roots. □ 

If f has all real roots, but all the roots of f are not distinct, then f and f ' have 
roots in common. In this case we have that f <f. 

The derivative operator Df = f is of fundamental importance. It is a linear 
transformation P — ► P, P Qlt — > P alt , and P pos — > P pos . We just observed 
that D preserves roots. We will see in Theorem 11.471 that it also preserves 
interlacing. 

We will occasionally make use of the stronger version of Rolle's theorem 
that states that there are an odd number of roots (counting multiplicity) of f 
between any two consecutive distinct roots of f . 

Example 1.2. If f is a polynomial with all real roots, then it is not necessarily 
true that some anti-derivative of f has all real zeros. Equivalently, the deriva- 
tive does not map P(n) onto P(n — 1). A quintic p(x) such as in Figure lPl has 
four critical points, so p' has four zeros. However, no horizontal line intersects 
p in more than three points, and so no integral of p' will have more than three 
roots. For instance, (x + 1) (x + 2) (x + 4) (x + 5) is not the derivative of a quintic 
with all real roots. 




Figure 1.1: A quintic with 4 critical points, and at most 3 roots 



We introduce a relation that is weaker than interlacing. See Section fl.lOl for 
more properties. 

Definition 1.3. Suppose that 



f(x] = (x-r 1 )---(x-T T J 

g( X ) = (X-Si)--- (x- S m ) 



n ^ r 2 ■ ■ ■ < r 

Si ^ S2 ■ ■ • ^ S 



n 



m 
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We say that g M f if m < n, and si ^ n for i = 1, . . . , m. 
In other words, each root of g is at most the corresponding root of f . Note 
that the degrees of f and g do not need to be equal. 

Common interlacing and -4 imply interlacing. The proof of the Lemma is 
immediate. 

Lemma 1.4. If 

2. f <g 

2. f and g have the same degree 

3. f and g have a common interlacing 
then f<q. 

Common interfacings can be split into two interlacings. 

Lemma 1.5. If f , g have the same degree and have a common interlacing then there 
are f ly f 2/ gi, g 2 eP such that 

f i < — gi f = fifz 

92 < — h g = gi92 

Proof. Choose h < g and h < f . There is one root of f and one root s^ of g 
between the ith and (t+ l)st roots of h. We may assume that the roots of f and 
g are distinct. The desired polynomials are defined by 

roots(fi) ={n | n < St} roots (gi) = {st | n < Si} 

roots(f 2 ) = {n | n > sj roots(g 2 ) ={s t | n > s t } 

□ 

Remark 1.6. Here are a few important properties about polynomials. See 11411 . 
First of all, the roots of a polynomial are continuous functions of the coeffi- 
cients. Consequently, if a sequence of polynomials {f n } converges to the poly- 
nomial f and all f n have all real roots, then so does f . In addition if there is a 
polynomial g such that g < — f n for all n then g < — f . We should also men- 
tion that roots of polynomials with real coefficients come in conjugate pairs. 
It follows that if a polynomial of degree n has n — 1 real roots, then it has all 
real roots. Lastly, if fg has all real roots, then so do f and g. 

1.2 Sign interlacing 

The easiest way to show that two polynomials with all real roots interlace is 
to show that one sign interlaces the other. The sign of a number c is defined as 
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[1 if c> 

sgn c = < — 1 if c < 
[o ifc = 

We say that f sign interlaces g if the sign of f alternates on the roots of g. 
More precisely, there is an e = ±1 so that if r is the i-th largest root of g then 
the sign of f (r) is e(— l) 1 . 

We usually assume that polynomials have positive leading coefficient. In 
this case, if f <C g then there is exactly one root of f that is larger than the 
largest root r of g. Since f has positive leading coefficient, it must be the case 
that f (r) is negative. Consequently, if g has degree n then the sign of f at the 
i-th largest root of g is (— l) 1+n+1 . 

It should be clear that if f and g interlace then f sign interlaces g, and g 
sign interlaces f . The converse is not always true, and is discussed in the first 
three lemmas. Figure [L2l shows two interlacing polynomials f <C g, where f is 
the dashed line, and g the solid line. The solid circles indicate the value of f at 
the zeros of g - notice that these circles alternate above and below the x-axis. 




Figure 1.2: Two interlacing polynomials 



Lemma 1.7. Suppose f, g are polynomials satisfying 

• f has all real roots, all distinct. 
. deg(f) =deg(g) +1 

• g sign interlaces f 
then f < g. 

Proof. Assume that f has n roots. Since the sign of g alternates between con- 
secutive roots of f, we find n — 1 roots of g, one between each pair of consec- 
utive roots of f . □ 
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If we knew that f sign interlaced g in Lemma 11.71 then it does not follow 
that f < g. For instance, take f = x 3 and g = (x — l)(x + 1), so that the roots 
of f and g are {0, 0, 0} and {— 1, 1}. The reason that g and f do not interlace is 
that although we have roots of f between roots of g, there aren't any roots of 
f outside of the roots of ( g ) . In order to find these roots, we must control the 
behaviors of f and g by specifying appropriate signs at the largest and smallest 
roots of g. 

Figure fOl shows graphically that if f (solid) sign interlaces g (dashed) then 
it is possible that f is not interlaced by g. 



Lemma 1.8. Suppose f, g are polynomials satisfying 

• f has all real roots, all distinct. 

• deg(f) = deg(g) - 1 

• g has positive leading coefficient. 

• g sign interlaces f. 

• Let a be the smallest root off, and (3 the largest root off. If either 

1. sgn f((3) = —1 or 

2. sgn f(a) = 1 

then g < f . 

Proof. Since g sign interlaces f there is a root of g between adjacent roots of 
f. This determines deg(f ) — 2 roots, so it remains to find where the last two 
roots are. If sgn(g((3)) = — 1 then since lim^oo sgn(g(x)) = 1 there are an 
odd number of roots (hence exactly one) to the right of (3 . Similarly there is a 
root to the left of a. See Figure 11.31 □ 

If f and g both have the same degree and they sign interlace, then either 
f<Cg or gCf. Again, we need extra information about the behavior on the 
extremes to determine which case holds. Figure fl~4l shows the two possibili- 
ties. 




Figure 1.3: Sign interlacing doesn't imply interlacing. 



Lemma 1.9. Suppose f, g are polynomials satisfying 
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• f has all real roots, all distinct. 

• g has positive leading coefficient. 

• deg(f) = deg(g) = n 

• g sign interlaces f. 

• Let a be the smallest root off and |3 the largest root. Then 

1. I/sgngO) = lffenf<g. 

2. Ifsgn g((3) = -1 theng^i. 

3. Ifsgn g(a) = -(-l) n f/zen g<f. 

4. Ifsgn g(a) = (-l) n then f<g. 

Proof. Assume that f has n roots. Since the sign of g alternates between con- 
secutive roots of f, we find n — 1 roots of g, one between each pair of consec- 
utive roots of f . Now g has an odd number of roots in each of these intervals, 
and there cannot be three roots of g in any of them, for then g would have at 
least n + 2 roots, so there is one more root of g, to either the left of the smallest 
root of f, or to the right of the largest root. Thus, we have either f -c g or g <C f . 

All four parts are similar, so we will only establish the first one. Assume 
that sgn g(|3) —1. Counting the number of alternations of sign of g gives 



x — > — oo 

m-L„„ n i -i "v tl — 1 



lim sgng(x) = (-1) 

O 

sgng(a) = (-l^sgn (3 = (-1) 

and hence there is a root of g in (— oo, a). This shows that f <C g. □ 

As a first consequence of these elementary but fundamental results, we 
choose 

Lemma 1.10. Ifi, g have positive leading coefficients and there is an h so that h < — 
f , g then for all positive a and ft the linear combination af + (3 g has all real roots and 
h< — af + (3g. 

Proof, f, g both sign interlace h, and since they have positive leading coeffi- 
cients, they are the same sign on all the roots of h. Consequently, ocf + |3 g has 
the same sign on a root of h as does f or g. The result follows from Lemmas ll.9l 
and [U □ 



CHAPTER 1. POLYNOMIALS IN ONE VARIABLE 



9 



Although the sum of two arbitrary polynomials with all real roots does not 
necessarily have all real roots, they might if one of them is sufficiently small. 

Lemma 1.11. Suppose f e P has degree n and all distinct roots. Ifg&P has degree 
n + 1 then f + eg g P for sufficiently small e. 

Proof. The roots of f + eg converge to the roots of f as e goes to zero. Since 
the n roots of f are all distinct, f + eg has n real roots for e sufficiently small. 
Since there is only one root unaccounted for, it must be real. □ 

The next well-known lemma is easy to see geometrically. 

Lemma 1.12. 

1. If there is an a such that f (x) Ice P for all |c| > a then f is linear. 

2. If there is an a such that f (x) Ice P for all c > a then f is quadratic with 
negative leading coefficient. 

3. If there is an a such that f(x) + c S P for all c < a then f z's quadratic with 
positive leading coefficient. 

Sign interlacing determines the intervals where sums of polynomials have 
roots. In general the roots lie in alternating intervals, but which set of inter- 
vals depends on the behavior at infinity. This is in turn determined by the 
sign of the leading coefficient of the sum. For instance Figure 11.51 shows two 
polynomials f (dashed) and g (solid). The intervals on the x-axis where f + ag 
has roots for positive a are solid; the intervals where they do not have a root 
(since both functions have the same sign) are dotted. 



g f 




Figure 1.5: Intervals containing roots of f + ag, a > 0. 



Lemma 1.13. Assume that polynomials f, g, h have positive leading coefficients. Iff 



f g 

g h 



and g interlace, and 
say that 

• Iff < g then g < h. 

• If f <c g then g < — h. 

• Iff > g then g — > h. 



< Ofor all x then g and h interlace. In more detail we can 
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• J/f >g then g>h. 

Proof. Assume that the degree of g is n. If r is a root of g then upon evaluating 
the determinant at r we find that f (r)h(r) < 0. Since f sign interlaces g, so also 
does h sign interlace g. This implies that the degree of h is at least n — 1. It is 
not possible that the degree of f h is greater than 2n, for then f h — g 2 will be 
positive for large values of x, since the leading coefficient of f h is positive. We 
now consider the four cases in turn. 

If f < g then the degree of f is n + 1. Since the degree of h is at least n — 1, 
and the degree of fh is at most 2n, the degree of h is exactly n — 1. Since h 
sign interlaces g, we can apply Lemma H. 71 to conclude that g < h. 

If f > g then the degree of f is n — 1, and the degree of h is either n + 1, 
n — 1 or n. The sign of f at the largest root r of g is positive, so the sign of h 
at r is negative. Since the leading coefficient of h is positive, there is a root of 
h in (r, oo), and so the degree of h can not be n — 1. Lemma [l .91 now implies 
that h < g if the degree of h is n + 1 . If the degree of h is n then h <C g . 

When f ^> g then the degree of h is n or n — 1. Again the sign of f on the 
largest root r of g is positive, so there is a root of h in (r, oo ) . Thus h has exactly 
n roots and g ^> h. 

If f <C g then a similar argument applies. □ 

Corollary 1.14. If f < g have positive leading coefficients and h = af + (bx + c)g 
then h < g if and only if the sign of the leading coefficient ofh is sign(a). 

Proof. Since f sign interlaces g it follows that h sign interlaces g. If (3 is the 
largest root of g then f ((3) < so the sign of h((3) is — sign(a). Use Lemma [L8l 

□ 

As a consequence we get a special case that will prove useful later. 

Corollary 1.15. If g G P[n) has positive leading coefficient and a £ (0,n) then 
xg' — ag G P. 

If a G (0,n) and g has p positive roots and m negative roots then xg' — ag has 
at least p — 1 positive roots and at least m — 1 negative roots. It is possible that 
xg' - ag £ P. 

Proof. We may assume that the leading coefficient of g is 1 . The leading coeffi- 
cient of xg' — ag is (n — a) . If a > n then we can apply Corollary ll.141 If a < 
then note that xg < g and xg <t xg' and therefore xg <J xg' + (— a)g. 

If a G (0,n) then xg' — ag sign interlaces the positive roots of g and the 
negative roots of g. This implies that it has at least p — 1 positive roots, and 
m — 1 negative roots, but it does not imply that the remaining two roots are 
real. For example, if g = x 2 — 1 and a = 1 then xg' — g = x 2 + 1 which has no 
real roots. □ 

The locations of the roots of an arbitrary linear combination can be nearly 
any value. For instance, 
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Lemma 1.16. If f, g interlace, are not multiples of one another, and (a,b) is any 
non-empty interval then there is an a such that 

• f+ageP. 

• f + ag has a root in (a, b). 

ffr) 

Proof. Choose an r e (a, b) that is not a root of g. If we define a = — then 

gW 

r is a root of f + ag. Now f + ag is not the zero polynomial since f and g are 
not multiples of one another, so f + ag is in P. □ 

However, in the case of the derivative, we can give precise information 
about the location of the roots of a linear combination. 

Lemma 1.17. Iff e P (Q ' b) (n) andot>0 then f + af ' e p( a - att < b >. 

Proof. The only question is the location of the smallest root. Assume f has 
positive leading coefficient. Since sgnf(a — an) = (— l) n it suffices to show 
that 

->-^(a-au) (1.2.1) 
a f 

since this implies sgn (f + af')(a — an) = (— l) n . Suppose that the roots of f 
are r\, . . .,r n . Then 

— (a -cm] = > ■ — 

f z — an — a + Tt 

Now since th > a it follows that an — a + Ti > an and so 

1 1 1 

— =n — >> 

a an z — an — a + 

□ 

The following lemma combines addition and subtraction, since f 2 — g 2 — 
(f + g)(f — g). It is not a coincidence that the terms come from the binomial 
expansion (f + g) 2 - the result will be generalized in Lemma r20.36[ 

Lemma 1.18. Suppose that f <C g, the leading coefficient off is Cf, and the leading 
coefficient of q is c g . 

If |Cf|>|c g | then f 2 - g 2 < 2fg 

If |Cf|<|c g | then f 2 - g 2 > 2fg 

If jcf| = |c g | then f 2 - g 2 > 2fg 

Proof. The proof is a standard sign interlacing argument. For example, if the 
degree is 3, then the diagram below shows where the roots of f + g (marked 
"+") and f - g (marked "-") are: 

g f — g f — g f 

In the third case this is all of the roots. In the first case there is a root on the 
far right, and in the second case the remaining root is on the far left. □ 
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The following convolution lemma is a non-obvious consequence of sign in- 
terlacing. It will be generalized to a wider class of polynomials in Lemma r3.16l 

Lemma 1.19. Choose qj ^ • • • < a n , bj < • • ■ < b n and define f = YI( X ~ a 0/ 
g = Y[[x — hi). The following polynomial has all real roots 



x-b n+1 _ 

Proof. Without loss of generality we may assume that the roots of f g are X\ < 
•" < i~2n- We wm show that the sign of h(r m ) only depends on m. More 
precisely we show that sgn(h(r m )) = (— l) m+1 if m ^ n, and (— l) m if m > n. 
This shows that there are 2n — 2 intervals between roots where the sign at the 
endpoints is positive and negative, so h has 2n — 2 roots, and hence is in P 
since its degree is 2n — 2. 

We may assume that v m — a.t. Almost all terms of h are zero when evalu- 
ated at r m : 

h(r m ) = ( • g • ] (a t ) 

We investigate each of the three factors in turn. The first factor is simply f ' ( a\ ), 
and the sign of this is (— l) 1+n . 

Next, since g(ai) = Y\j ( a i — bj ), the sign of g(aj.) depends on the number 
of bj's that are greater than cu. Since cu is the m-th largest root, there are 
exactly m—i roots of g that are smaller than a^, and hence there are n— (m— i) 
roots of g that are greater than at. Therefore, sgn(g(cu) = (— l) TV-m+ \ 

Finally, consider the sign e of at — b n+ i_i. The largest ) such that bj is 
less than at is j = m — i. Consequently, the sign e is negative whenever 
(n + 1 — i) > m — i, and so e = 1 if m n and — 1 if m > n. Combining these 
calculations shows that the sign of h(r m ) is (— l) t+n ( — l) n ~ m+1 -e which is as 
claimed. □ 

1.3 A quantitative version of sign interlacing 

If f, g are two polynomials of the same degree n that sign interlace, then we 
know the signs of g evaluated at the n roots of f . This is similar to interpola- 
tion, where we are given the values of g, not just the signs, at these roots and 
use this information to determine g. We can combine these two ideas to give 
a quantitative version of sign interlacing. 

Let f be a polynomial with distinct roots roots(f) = (ai,...,a n ). Then 
polynomials 

fM ... f(x) 
x — Qi ' x — a n 

are of degree n — 1 and form a basis for the polynomials of degree n — 1 . If 
we add any polynomial of degree n (e.g. f itself) then we get a basis for the 
polynomials of degree n. 
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Lemma 1.20. Assume that f is a polynomial of degree n, with positive leading co- 
efficient, and with roots {a\, a n }. Suppose that g is a polynomial with positive 
leading coefficient. 

1. Ifg has degree n — 1 and we write 

g(x) = ci + hc n (1.3.1) 

x — ai x — a n 

then f < g if and only if all C\ are positive. 

2. If g has degree n and we write 

g(x) = c f(x)+ci-^ + --- + Cn-^ (1.3.2) 

x — ai x — a n 

then c is positive, and 

• f -c g if and only if all C\ are positive. 

• f 3> g if and only if all c\ are negative. 

3. If g has degree n + 1 and we write 

g(x) = (dx + c)f (x) + ci + • • • + c n (1.3.3) 

x — ai x — a n 

then d z's positive, and g < f z/and on/y z/a/Z ct are negative. 

Proof. Evaluate g (in all three cases) at the roots of f : 

g(cti) = Ci(ai - ai) ■ ■ • (at - cu-iHai - a i+ i) ■ •• (at - a n ) 

so sgng(at) = sgn(ci)(— l) 1+n . In the first case we know that deg(g) = 
deg(f) — 1. If all the Ct have the same sign then g sign interlaces f, and the 
result follows from Lemma lL7l 

In the second case, if all Ct are positive, and deg(g) = deg(f), then g sign 
interlaces f . Since the sign of g at the largest root a n of f is the sign of the 
leading coefficient of f, and the leading coefficient of g is positive, the result 
follows from Lemma [1.91 

In the third CclSG, S CJ7T Cj(cLrx) — SQTl Cn and since the leading coefficient of g 
is positive, we find g < f by Lemma [1.81 

Conversely, if f < g, then g sign interlaces f, and so the signs of g(at) al- 
ternate. Thus, all Ci's have the same sign, which must be positive since the 
leading coefficient of g is positive. The converses of the second and third 
cases are similar, and omitted. □ 
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Remark 1.21. We can extend Lemma ["1.201 by noticing that we do not require 
the coefficients Ci to be constant. If Ci (x) is positive (or negative) on an interval 
containing the roots of f , then g sign interlaces f . 

One special case of (|1.3.1|l is when all the C\ are 1. In this case, g is exactly 
the derivative of f . 

If we relax the positivity and negativity conditions of the parameters Ci's 
in Lemma [1.201 then we still have interlacing. For example, when the degree 
of g is n — 1, then f < g if and only if all Ci are non-negative. This case where 
some of the coefficients Ci are zero follows by continuity. 

Corollary 1.22. If f < g have positive leading coefficients then 

xf — bg <. f for any positive b. In addition, xg + bf < g. Ifb is negative then it is 

possible that xf — bg and xg + bf are not in P. 

Proof. The first is immediate from Lemma [1.20| if we write 

g = Y. c i ■ m me second case we can write f = axg + 13 g — r where g < r. 

Since (xg + bf) + yg = (a + b)g + (b(3 + y)g — br the conclusion follows. □ 

We can also get interfacings such as xg + bf <J g if we restrict g to P p ° s . See 
Lemma [2. 151 
The equation 

(-l) l+n sgng(cn)=sgnct (1-3.4) 

shows that the signs of the Ci.'s encode the sign of g evaluated at the roots of f . 
We can easily determine the Cj/s in terms of g and f . For instance, if f < g then 

c i = g(a i )/f'(a i ). 

There is another condition on the signs of the coefficients in dl. 3.11 1 that 
implies that g has all real roots. 

Lemma 1.23. Suppose that the roots of the monk polynomial f are m < <i2 < • • • < 
a n . Ifc\, . . . Cn. in 111.3.11 has exactly one sign change then g(x) in ( |1.3.H has all real 
roots. If Ci, . . . , c n has exactly two sign changes, c\ > 0, and Y_ Ci < then g (x) 
has all real roots. 1 

Proof. If the c/s have exactly one sign change, then assume that the terms 
of the sequence Z\, . . . , c T are positive, and c r+ i, . . . , c n are negative. | |1.3. W 
shows that g sign interlaces f between a.\, . . . , a r giving r — 1 roots, and also 
between a r+ i . . . , a n giving n — r — 1 roots for a total of n — 2 roots. Since g 
has degree n — 1 it follows that g has all real roots. 

If there are exactly two sign changes then the above argument shows that 
there are at least n — 3 real roots. The Ci's are initially positive, then negative, 
and then positive again. The leading coefficient of g is c i which is negative, 
and since sgn g(a n ) = sgn c n > there is a root of g in the interval (a n , oo), 
which again implies that g has all real roots. □ 

1 The condition Y_ c i < is necessary. If f (x ] = (x + l){x + 2)(x + 3) then a quick computation 
shows that — ^ + ^ P, and Y_ <H = 1. 
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Common interlacing puts no restrictions on the signs of the coefficients. 

Lemma 1.24. Any sign pattern can be realized by a pair of polynomials with a com- 
mon interlacing. 

Proof. This means that if we are given any sequence of pluses and minuses 
then we can find two polynomials gi, g 2 with a common interlacing such that 
if 92 = I!( x - r t) and 

gi = bg 2 + )_a i — - 

then sign(ai) = e^. 

We prove this constructively; the initial cases are trivial. Assume we are 
given gi, g 2 as above, and choose cx > (3 to both be smaller than the smallest 
root of gig 2 . We will show that the sign sequence of 

fi = (x-a)gi, f 2 = (x-|3)g 2 



is 



+, £\, . . . , e n 

By construction fi and f 2 have a common interlacing. Now at = gi(ri)/g 2 (n). 
Thus the sequence of signs determined by f i, f 2 is 

fi(|3) fi(n) fi(r n ) 



Now 



f 2 (P)'f 2 (n)' 'f 2 (r 



fi(P) _ fB ^ 9i(Pj 
f£(0) _lP aJ g 2 ((3) 

fi(n) (n-a)gi(ri) f^n 

= (n - a) 



f£(n) (n-(3)g 2 (n) + g 2 (n) 11 'f^(n) 



Since (3 is smaller than all the roots of gi and g 2 , the signs of gi((3) and g 2 (|3) 
are opposite. Also r t — a is positive, so the signs are as claimed. 

If we choose a > |3 instead, we get a minus sign for the sign, so all possi- 
bilities can happen. □ 

The next result is a simple consequence of quantitative sign interlacing, 
and is useful for establishing many interlacing results. 

Lemma 1.25. Assume f, g, j,k e P have positive leading coefficients, and f < g, 
k<f <Cj. Define 

h = £if + « 2 j + £ 3 i<+qg 

where l\,li,h <**e linear, and q is at most quadratic. The interlacing h< f holds if 
these three conditions are true: 
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1. The leading coefficients ofi\, I2, £3 are positive. 

2. q and I2 are negative on the roots off. 

3. £3 is positive on the roots ofi. 

Proof. Assume that roots (f) = (qj, a n ) and write 

g = > on 

z — x — di 

i = |3f+V" Pi— — 
* — x — di 

k = yf+y" Yi 

* — X — di 

where at, (3i, |3, y are positive, and yi is negative. From the definition 
h = [k + P« 2 + Y* 3 )f + 5~ (Pi<2 + Yi«3 + a iq )^^. 

* — X — di 

The hypotheses guarantee that the leading coefficient of l\ + (3£2 + Y^3 is pos- 
itive, and that |3i£2 + Yvfe + otiq is negative on the roots of f. The result now 
follows from Lemma [1.201 □ 

We will see consequences of the theorem in § 1212.41 As a quick example, 

xf-f-f'eP if f g P (1.3.5) 

because i\ — x— 1, I2 — I2 — 0, q = — 1 satisfy conditions 1, 2, 3. The hypotheses 
on i\, I2, £3 can be weakened by only requiring that l\ + |3£2 + Y^3 has positive 
leading coefficient. 

If we assume that all roots of f are positive (or negative) we can get more 
specific interlacing results. These interlacing facts will be useful in showing 
that various linear transformations preserve interlacing. See § 1212.41 

If polynomials f, g are in the relation f < — g, then g behaves like a deriva- 
tive of f . 

Lemma 1.26 (Leibnitz Rule). Suppose that f, f %, g, gi are polynomials with positive 
leading coefficients, and with all real roots. Assume that f and g have no common 
roots. 

• Ifi < — fi and g < — gi then fig, f gi, f g < — f gi + f x g < — figi 

In particular, fgi + fig has all real roots. Conversely, if i, g, fj, gi are polynomials 
that satisfy fg < — fig + fgi then 

1. f,g,fi,gi e P 



CHAPTER 1. POLYNOMIALS IN ONE VARIABLE 



17 



Proof. We only consider the case where the left hand side is f g, the other cases 
are similar. Let f = a Oi^* ~ a i) / 9 = b — ^H) H we write 

fi(x) = cf[x) +Cj 1 hc n (1.3.6) 

x — ai x — a n 

gi(x) = dg(x) + d 1 -^-+--- + d m -^- (1.3.7) 
x — bi x-b m 

where all the c/s and d/s are positive and c, d are non-negative then 



fgi + fig = (c + d)f(x)g(x}+ 

fMg(x) f(x)g(x) f(x)g(x) f(x)g(x) 

ci 1 hc n hdi 1 hd m — (1.3.8) 

x — Qi x — a n x — bi x — b m 

and the result follows from Lemma [1.201 

Conversely, if fg < — fgj + fig then we first observe that f, g e P. We 
can write fj in terms of f as in J1.3.6L and gi in terms of g as in fll.3.71 1, but 
we do not know the signs of the Ci's and the dj/s. However, from dl-3.8l > we 
know all the coefficients are negative. Thus the Cj/s and di's are negative. 
The coefficients c, d are non-negative since all the polynomials have positive 
leading coefficients. Thus f < — fj and g < — gj. □ 

Corollary 1.27. Suppose that f < g have positive leading coefficients, and a, c are 
positive. 

If ad < be then (ax + b)f + (cx + d)g € P. 

Proof. Note that ad ^ be is equivalent to cx + d <f; ax + b. Apply Lemma ll.261 

□ 

Remark 1.28. The Lame differential equation is 

f(xhj" + g(xhj' + h(xhj =0 (1.3.9) 

Heine and Stieltjes proved that if g € P(p), and f <J g then there are exactly 
( n+ ^ _1 ) polynomials h(x) such that the equation has a solution y which is a 
polynomial of degree n. Moreover, both h. and y are in P. The polynomials h 
are called Van Vleck polynomials, and the corresponding solution is a Heine- 
Stieltjes polynomial. 

Although h and f do not interlace, we do have B151II that fy' < hy. To see 
this, note that y' <Ly" and f < g, so by Leibnitz we have 

fu'<fy" + gu' =-hy 

This also shows that if y € P then he P. 

Recall that a cone is a set in a vector space that is closed under non-negative 
linear combinations. Lemma 11.201 describes all polynomials that interlace a 
given polynomial. If we restrict ourselves to polynomials with positive lead- 
ing coefficients then this set of polynomials is a cone. This is also true for 
polynomials in several variables - see Chapter llllll.l8l 
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1.4 Elementary interlacings 

In this section we use an algebraic definition of f <C g to derive many funda- 
mental interlacings. The following is a simple consequence of Lemma [1.201 

Corollary 1.29. If f , g have positive leading coefficients then f < — g if and only 
there is a polynomial h with positive leading coefficient and non-negative a such that 
f < h and g = af + h. In addition, if g < — f then there is a positive a and an h with 
positive leading coefficient satisfying f < h such that g = af — h. If the interlacing 
f < — g is strict then f < h. 

If f < — g then for any a, (3 the linear combination af + |3 g interlaces f 
and g . The exact direction depends on a and |3 . The converse is also true and 
of fundamental importance - see Proposition 11.351 This result is useful, for it 
gives us a simple way to show that the sum of certain polynomials with all 
real roots has all real roots. 

Corollary 1.30. Suppose f < — g, both f and g are monk, and a, |3 are non-zero. 

af + |3g — > f if |3 and a + |3 have the same sign 

af + |3g < — f if |3 and a + (3 have opposite signs or a + |3 = 

af + |3 g < — g if a and a + (3 have the same sign 

af + |3g — >g if a and a + (3 have opposite signs or a + (3 = 

If f < g and a, |3 «a??£ f«e same szgTi 

f < — af + |3g < — g 
f — >af- (3g < — g 

Proof. Since f , g are monic we can write g = f + h as in Corollary 11.291 We 
prove the first two assertions; the remainder are similar. Substitution yields 
af + |3g = (a + |3)f + (3h. Since (a + (3) and |3 are the same sign, the desired 
interlacing follows since f — > f and h — > f . 

For the second one, write in terms ofg: af+ |3g = ( a + |3)g — ah. If a + (3 
and a have opposite signs then g — > f and h — > f . □ 

We have seen that we can add interlacings. This is a fundamental way of 
constructing new polynomials with all real roots. Here are some variations. 

Lemma 1.31. Suppose f , g, h have positive leading coefficients, and a, |3 are positive. 

1. Ifi < — g and h < — g then af + |3h < — g. 

2. Iff < — g and f < — h then f < — ag + |3h. 

3. Iff < — g < h then af — |3h < — g. 
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Proof. Immediate from the representations of Lemma [j.20l and Corollary II. 291 

□ 

An interlacing square is a collection of four polynomials f, g, h, k that inter- 
9 

lace as follows: f k . If we have such a square then there are interlac- 

h 

ings between certain products. It is interesting to note that although we do not 
have the interlacing f g < — gk nor f g < — fh it is true that the sum interlaces: 
f g < — gk + fh. 

g 

Lemma 1.32. If f , g, h, k have positive leading coefficients and f k is an 

h 

interlacing square then 

fh fg fk 

fk+gh fh+gk 

kg hk hg 

Proof. The interlacings involving fk + gh follow from Corollary II. 301 as do the 
right arrows from fh + gk. We now show that f g and gk + fh interlace. Let 
f and g have roots {ai, . . ., a n } and {bj ^ • • • ^ b n }. Interlacing follows from 
these evaluations at the roots of f and g: 

sgn(gk + fh)(aO = sgn g(a0k(a0 = 1 
sgn(gk + fh)(bi) = sgnf (bi)h(bi) = -1 



Since f < — g the sign of gk + fh at the largest root b n of f g is negative, and so 
f g < — gk + fh. The remaining case is similar. □ 

Remark 1.33. We can use the representation of Corollary ll.29l to derive a simple 
relationship between the coefficients of interlacing polynomials that will be 
generalized later. Suppose f<g have positive leading coefficients and f = 

a n x n + a n _ix n_1 H , g = b n x n + b n _ix n_1 + • • • . We can write g = ocf + r 

where oc = b n /a n , and the leading coefficient of r is b n _i — ^a n _i. As 
long as g is not a multiple of f we know that the leading coefficient of r is 
positive. Using the fact that a n is positive, we conclude that | bZ-l I > 
0. In Corollary 14 .3 1 1 we will show that all such determinants of consecutive 
coefficients are positive. 

Squares don't interlace. 
Lemma 1.34. Ifi, ggP, and f 2 and g 2 interlace then f is a constant midtiple of g. 
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Proof. If g has a root that is not a root of f then g 2 has two consecutive roots 
that are not separated by a root of f . Thus g divides f , and f divides g, so they 
are constant multiples. □ 

1.5 Linear combinations of polynomials 

When do all linear combinations of two polynomials with all real roots have 
all real roots? The answer is quite simple: they interlace. This important result 
is central to the generalizations of interlacing to polynomials in several vari- 
ables that is developed in Chapter|9l and of polynomials with complex roots. 
The following proposition has been (re)discovered many times, see |141l page 
210]. 

Proposition 1.35. Suppose relatively prime polynomials f, g have all real roots, all 
distinct, with deg(f ) ^ deg(g). Suppose that for all positive a, |3 the linear combi- 
nation od + |3 g has all real roots. Then either 

1. deg(f ) = deg(g) and there is a polynomial k such that k < f, k < g. 

2. deg(f ) = deg(g) + 1 and then there is a polynomial k such that k < g, k ^> f 
or k<f. 

3. deg(f) = deg(g) + 2 and there is a kso that f < k < g. 

If od + (3g has all real roots for all positive and negative <x, |3, then either 



4. deg(f ) = deg( g ) and either f <c g or g <C f . 

5. deg(f) = deg(g) + 1 and f < g. 

In addition, the converses of all the cases are valid. 

Proof. Assume that the roots of fa nd g are Qj < • • • ^ a n and bi ^ • • • ^ 
b m . These n + m points determine n + m + 1 open intervals, and f and g 
have constant sign on the interior of each of these intervals. Enumerate these 
intervals from the right, so that Ii is either (a n ,oo) or (b m , oo). On adjacent 
intervals, one of f , g has the same sign on both intervals, and one has different 
sign on the two intervals. 

If f , g have leading coefficients with the same sign then they have the same 
sign on Ii, and hence on all odd numbered intervals. Thus, the roots of af + 13 g 

lie in the even numbered intervals I2, 14, By the continuity of the roots as a 

function of a, (3, there are the same number of roots of af + (3 g in each interval. 
By choosing a = 1 and |3 small, we see that each interval has at most one 
root. Thus, since af + |3 g has n roots, the number of even numbered intervals, 
|_(n + m + 1)/2J , is at least n. This implies that n = morn = m + l. 
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If we assume that f , g have leading coefficients with opposite signs, then 
the roots lie in the odd intervals of which there are [(n. + ra + 2)/2J. Conse- 
quently, n — m, n = m. + lorn = m + 2. 

If both endpoints of an interval are roots of f (say) then g is the same sign 
on the closed interval. If we choose (3 sufficiently small, then we can force 
f + |3g to have the same sign on the interval, and hence there are no roots 
there. If we can choose (3 to be any sign, then choosing (3 large gives at least 
two roots of af + |3 g on the interval. Consequently, when a, (3 are unrestricted 
in sign, intervals have one endpoint a root of f and the other a root of g. This 
implies (4) and (5). 

If deg(f) = deg(g), and their leading coefficients have the same sign then 
the n roots of af + |3g lie in the n even intervals. By the above paragraph each 
interval has one endpoint a root of f and one a root of g. Choose k to have one 
root in each of the odd intervals. 

If the leading coefficients differ in sign, then the the roots lie in the n + 1 
odd intervals. Since Ii and Iin+i each have only one root of f, g, and every 
other interval has one root of f and one root of g, choose k to have one root in 
each even interval. 

If deg(f ) = deg(g) + 2 then there are n odd intervals and n — 1 even inter- 
vals, so all the roots are in the odd intervals. It follows that from the right we 
have a root of f, then a root of f and of g in some order, a root of f and of g in 
some order and so on until the leftmost root which is a root of f . We choose k 
to have roots in all the even intervals. 

The case deg(f) = deg(g) + 1 is similar and omitted. 

□ 

The only unfamiliar case is when the degree differs by two. The thick lines 
in Figure [L6] shows where the roots of f + |3 g are located in case f has degree 4 
and g degree 2, and (3 is positive. A similar discussion shows that if f — ag S P 
for all positive a, then f and g have a common interlacing. 




Figure 1.6: Roots of f + g where the degrees of f, g differ by 2 



Corollary 1.36. Assume that the monic polynomials i\, . . . f n all have degree n, and 
have all real roots. The following are equivalent: 

1. For all positive oc\,..., a n , ajfj + ■ ■ • + a n f n has all real roots. 
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2. f i, . . . , f n have a common interlacing. 

Proof. By choosing all but two of the a's sufficiently small, we find from 
Proposition [L35] that for all i ^ j, f^, f , has a common interlacing. It is easy to 
see that this implies that they all have a common interlacing. (See p. [80]) □ 

If we have three polynomials whose linear combinations always have 
roots, then the third is a linear combination of the other two. 

Corollary 1.37. Suppose that f, g, h are three distinct polynomials such that for all 
a, (3, y the polynomial af + |3g + yh has all real roots. Then two of them interlace, 
and the third is a linear combination of the other two. 

Otherwise stated, the largest vector space contained in P has degree 2. 

Proof. If we set one of a, |3,y to zero then by Proposition II .351 we know that 
any two of f, g,h interlace. After renaming, we may assume that f<q<h. 
Fix an a and consider the two polynomials k a = f + ah and g. Since every 
linear combination of k a and g has all real roots, we know that k a and g 
interlace. Moreover, ko^C g and k a ;|> g when a is large. There is an a for 
which the interlacing switches from <§; to at this value k a and g have 
the same roots. Thus, for this choice of a the linear combination f + ah is a 
multiple of g. □ 

Next we consider the case of linear combinations where each coefficient is 
a polynomial. Notice that the order of interlacing is reversed. 

Lemma 1.38. Suppose h, k are polynomials of the same degree with positive leading 
coefficients. If for every pair of polynomials f <c g with positive leading coefficients 
the polynomial f h+ gk has all real roots, then k <c h. If k and h have different degrees 
then k < — h. 

Proof. We may assume that k and h have no factors in common. We will only 
need to assume that the conclusion only holds for linear f, g. By continuity, 
fh + gk holds when f and g are multiples of one another. Choosing f = ax, 
g = bx shows that x(ah + bk) has all real roots for all positive a, b. From 
Proposition ll.35l we conclude that fi, k have a common interlacing. 

Next, choose f = a(x — r) and g = b(x — s) where r is larger than all roots 
of hk, and s is smaller than all the roots of hk. Since f <C g we know from 
the hypotheses that all positive linear combinations of fh and gk lie in P, and 
hence fh and gk have a common interlacing. Combined with the fact that h 
and k have a common interlacing, it follows that k <§; h. 

The proof of the second statement is similar and omitted. □ 

1.6 Basic topology of polynomials 

In this section we note a few facts about the topology of polynomials and of 
interlacing. The set V of all polynomials of degree n is a vector space of di- 
mension n+ 1, and is homeomorphic to R n+1 . The set P(n) of all polynomials 
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of degree n in P is a subset of V, and so has a topological structure. We will 
determine the boundary and interior of P(n). 

Since roots are continuous functions of the coefficients, if a polynomial f 
has all real roots and they are all distinct, then there is an open neighborhood 
of f in which all the polynomials have all real roots, all distinct. This is half of 
the next theorem: 

Theorem 1.39. 

• The interior ofP[n) consists of polynomials with all distinct roots. 

• The boundary ofP[n) consists of polynomials with repeated roots. 

Proof. Suppose that f has a repeated root, so that we can write f = (x — a) 2 g 
for some polynomial g in P. For any positive e the polynomial ( (x — a) 2 + e)g 
has two complex roots. Since e can be arbitrarily small, every neighborhood 
of f has a polynomial that is not in P. Thus, f is in the boundary of P(n). Since 
polynomials with distinct roots are in the interior of P(n), this establishes the 
theorem. □ 

P satisfies a closure property. 

Lemma 1.40. If f is a polynomial, and there are polynomials ft G P such that 
lim ft = f then f € P. 

Proof. Since the roots of a polynomial are continuous functions of the coeffi- 
cients, if f had a complex root then some ft would have a complex root as 
well. Thus, all roots of f are real, and f S P. □ 

If we allow sequences of polynomials whose degree can increase without 
bound then we can get analytic functions such as e x as the limits of polyno- 
mials in P. The proof of the lemma shows that such limits will never have 
complex roots. Such functions are considered in Chapter[5] 

We can use Proposition [L35] to show 

Corollary 1.41. Limits preserve interlacing. 

Suppose that i\, ii,... and gi, g2, . . . are sequences of polynomials with all real 
roots that converge to polynomials f and g respectively. Iff n and g n interlace for 
tl — 1, 2, . . . then f and g interlace. 

Proof. Since f n and g n interlace, f n + ag n e P for all a. Taking the limit, 
we see that f + ocg has all real roots for all a. We apply Proposition 11.351 to 
conclude that f and g interlace. □ 

The nature of interlacing can change under limits. For instance, as n — > oo 
the interlacing 



CI - -)x(l + -) <{x - l)(x + 1) 
n n 
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converges to 

x>(x-l)(x + l) 

However, if the degrees don't change then the direction is preserved. 

Lemma 1.42. Suppose that lim ft = f, lim gt = g are all polynomials of degree n. 
If fi^gi for all i then f -C g. 

Proof. Write gi — at ft + hi, where fi<ht, hi. has positive leading coefficient 
and oti > 0. Clearly the limits lim hi = h and lim <xi — a exist. Since g = af + h 
where f < h. and h has non-negative leading coefficient we have f<g. 

An alternative argument is to consider the location of the roots. □ 

1.7 Linear transformations that preserve interlacing 

In this section we establish some properties of a linear transformation that 
preserves interlacing. We begin by observing that if a linear transformation 
maps P to itself then it preserves interlacing. This is an important result, for 
it frees us from having to try to prove that a linear transformation preserves 
interlacing - all we need to do is to show that the linear transformation maps 
P to itself. 

Theorem 1.43. If a linear transformation T maps P to itself then T preserves inter- 
lacing. 

If we also assume that T preserves degree, then f < g implies Tf <: Tg. In addition, 
let h< k. 

1. If the leading coefficients o/T(x), T(x 2 ), T(x 3 ), ... all have the same sign, then 
Th<Tk. 

2. If the leading coefficients o/T(x), T(x 2 ), T(x 3 ), . . . have alternating signs, then 
Th>Tk. 

IfT maps P bijectively to P then all interlacings <|C above can be replaced with <C . 
More generally, iflisal — 1 open map then all <C and < can be replaced with <C 
and <. 

Proof. If f and g are in P then for all a the polynomial f + ccg lies in P. Since 
T maps P to itself, T(f + ag) = T(f ) + ocT(g) is in P. We now apply Proposi- 
tion ll.35l to conclude that Tf and Tg interlace. 

Now assume T preserves degree. We may assume that h and k have posi- 
tive leading coefficients. We write k = bh+m where b is non-negative, h^m, 
and m has positive leading coefficient. Since Tk = bTh + Tm and Th.<jTm, 
the results (1) and (2) follow from Corollary 1 1.29 1 

It remains to show that if T maps P onto itself and preserves degree then 
T preserves strict interlacing. T is an onto map from the set of polynomials of 
degree n to itself. Choose disjoint opens sets f e Oi, g e O2 of polynomials so 
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that he Oi and k 6 O2 implies h<§c k. Then T(Oi) and T(02) are disjoint open 
sets whose elements interlace, so Tf <c Tg. Note that this only used 1 — 1 and 
open, so the last assertion follows. □ 

Remark 1.44. The two conditions for the signs of the leading coefficients of the 
sequence T(x),T(x 2 ),T(x 3 ),. . . are the only possibilities. See Corollary 11.501 
below. 

Remark 1.45. If a transformation merely maps P pos to P, then T does not nec- 
essarily preserve interlacing. We will see (Lemma 18.84b that the linear trans- 
formation T(x n ) = (-l) n(n+1)/2 x n mapsP pos — ► P. However, 

T(x + l)=l-x has root (1) 

T(x + l) 2 = 1 - 2x - x 2 has roots (-2.4, .4) 
T(x + 1) 2 ^ T(x + 1) 

Corollary 1.46. If T is a linear transformation such that T(xf ) < T(f ) for all f e P 1 

(where I is an interval containing 0) and maps polynomials with positive leading 
coefficients to polynomials with positive leading coefficients then T defines a linear 
transformation P 1 — ► P that preserves interlacing. 

Proof. We first show that if h, (x — a)h, (x — |3)h e P 1 where |3 < a then 
T(x — a)h<f;T(x — |3)h. By hypothesis we see that T(x — a)h^Th, and so 
the conclusion follows from 

T(x - |3)h = T(x - oc)h + (cc - 6)T(h) 

Next, assume f g are in P 1 , let r and s be the largest roots of f and g, 
and define h = f/(x — r). Since we have the interlacings (x — r)h<C g and 
g<f; (x — s)h it follows from the remark above that we can find a chain of 
interlacing polynomials 

(x-r)h< • • • <g< • • • < (x- s)h 

where consecutive polynomials satisfy the conditions of the first paragraph. 
Thus 

T(x-r)h< • • • <T(g)< • • • <T(x- s)h 

Since (x — r)h^; (x — s)h it follows that T(x — r)hcT(x — s)h and hence the 
sequence of polynomials is mutually interlacing. In particular, T(f ) = T(x — 
a)h<T(g). ' ' □ 

The next result is fundamental. 

Theorem 1.47. Differentiation preserves interlacing. 
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Proof. The linear transformation T(f ) = f maps P to itself, and therefore pre- 
serves interlacing. The derivative does not map P(n) onto P(n — 1). (see 
Chapter 1717. 11} . However, it's easy to see that the derivative is an open map, 
so strict interlacing is preserved. □ 

The converse to Theorem 1 1 .471 does not hold 2 . 

The simplest linear transformations mapping P to itself are of the form 
x x i— > cijX 1 . Such linear transformations are well understood, see § 15115.21 Here 
we determine their image on P pos . 

Lemma 1.48. Suppose the linear transformation T(x l ) = a\x l maps P to itself. 
Then 

(1) All non-zero terms of the sequence Qo, ai, . . . are consecutive. 

(2) Either all non-zero cu have the same sign, or the signs alternate. 

(3) T maps P vos to either P pos or to P alt . 

Proof. If the non-zero en's aren't consecutive and there are at least two consec- 
utive zeros then there are r, s such that s— r > 2, a r ^ 0, a r+ i = • • • = a s _i = 0, 
a s 7^ 0. In this case T(x T (l + x) s ~ r ) = x r (a T + a s x s ~ r ), and the latter polyno- 
mial is not in P. If there is an isolated zero, then s — r = 2, and so a r + a r +2* 2 
has all real roots. This implies that a r and a r+ 2 have opposite signs. 

Next, T(x T (x 2 — 1)) = (a r+ 2 — a T x 2 )x r , so if a T and a r +2 are non-zero 
then they are the same sign. Consequently, there are no isolated zeros. Thus, 
the signs of the first two non-zero terms determine the signs of the rest of 
the terms, so we can conclude (1) and (2). If all the signs are the same then 
T(P pos ) C P pos , and if they alternate then T(P pos ) C P alt □ 

If a linear transformation maps P to itself, then the leading coefficients of 
the images of x 1 also determine a linear transformation that maps P to itself. 

Lemma 1.49. IfT:P — > P is a linear transformation that preserves degree, and 
the leading coefficient o/T(x l ) is C\, then the linear transformation S(x x ) — ctx 1 also 
maps P to itself. 

Proof. Let Tfx 1 ) = gi(x), and choose f = YJo aixL ^ If we substitute ax for 
x, and apply T, we find that Y_q a^g^x) G P. Next, substitute x/oc for x to 
find that Y_o c L i ^9i{ x / ^) € P. Since gt(x) is a polynomial of degree i we see 
that lim^^o ct l 9i(x/a) = Ctx 1 -, and so 

TL 

lim y diO^giCx/tx) = ^~ chcix 1 = S(f) e P. 

a— >0 ^ — * — 

i=0 

□ 



2 Iff = x(x-l)(x — 2) and g = (x — .5) 2 then f < g ' but it is not true that f < g. 
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Corollary 1.50. IfT: P — ► P is a linear transformation that preserves degree then 
the leading coefficients of T[x l ) either all have the same sign, or their signs alternate. 

Proof. Use Lemma ll.48l and Lemma lL"49l □ 



1.8 Linear combinations and determinants 

g 

Suppose that we have an interlacing square f k . There are close con- 

H 

nections between the determinant | £ £ | and the interlacings 
af + (3g < ah + (3k for all a, p € E. 

Lemma 1.51. Assume that f, g, h, k are polynomials with positive leading coeffi- 
cients whose degrees satisfy deg(f) = deg(g) + 1 = deg(h) + 1 = deg(k) + 2. If 
for all a, |3, not both zero, we have that af + |3g < ah + (3k then 

h 

2. | £ £ | < Ofor all x. 
Similar residts hold if we replace "<" by " <c ". 

Proof. We know that h <. k and f < g by Proposition ! 1.351 and the assumptions 
on degree. Setting a = or |3 = shows that f < h and g < k. If the determi- 
nant is zero then there is an r such that f(r)k(r) = g(r)h(r). 

There are two cases. First, assume that f (r) = 0. Since h and f have no 
roots in common, we know h(r) ^ 0. If we set a = k(r), (3 = — h(r) then for 
this choice we get the contradiction that 

(af + |3g)(r) = = (ah+ 0k)(r). 

The second case is f (r) ^ 0. The choices a = — g(r), (3 = f (r) yield another 
contradiction. 

If we evaluate the determinant at a root r of k we find h(r ) g (r ) ^ Thus k 
and h have no roots in common, and so h < k. Similarly f < g. 

Since the determinant is never zero, it always has the same sign. We deter- 
mine the sign of the determinant by evaluating it at the largest root r of f . The 
result is — g(r)h(r) which is negative since f < g and f < h. □ 

There is a converse. 

Lemma 1.52. Suppose f, g, h, k are polynomials in P with positive leading coeffi- 
cients. If they satisfy 



J. f<g 



CHAPTER 1. POLYNOMIALS IN ONE VARIABLE 



28 



2. h<k 

3. f<H 

4. I f , 9 I is neuer zero. 

I H k I 

ttew f + ag < h. + ode for all a. A similar result holds if < z's replaced by <c . 

Proof. Assumptions (1) and (2) imply that f + ag and h + ak have all real roots 
for all a and that the degree of f + ag is always one more than the degree 
of h + ak. Assumption (3) implies that the conclusion is true for a = 0. If 
(f + ag) (r) = (h+ ak) (r) then eliminating a shows that (fk — gh) (r) = 0. By 
assumption (4) the determinant is never zero, so as a varies f + ag and h + ak 
never have a root in common. Consequently, at all times they must satisfy 
f + ag < h. + ak. The case with <C is similar. □ 

Example 1.53. It is easy to construct examples of interlacing squares whose 
determinants are always negative. We will see more of these determinants in 
Chapter [8] If we apply Theorem 19 . 1 131 to the product 

f[x,y,z) = (l+x + u+z)(2 + x + 3u+z)(5 + x + 2u+2z) 

then we get the interlacing square 

(l+x)(2 + x)(5 + x) < 19 + 19x + 4x 2 
A A 
29 + 31x + 6x 2 < 36 + 16x 

and the determinant is always negative. If we apply the theorem to the deter- 



minant 



1+x+y+z 

2 + u+2z 1 

1 l+3y + z 



we get an interlacing square where all interlacings are <C : 



1 + x < 8 + 7x 
5 + 4x < 18 + 7x 



and the determinant arising from this square is always negative. 



Remark 1.54. If we differentiate the entries of the first square 




'3x 2 + 16x + 17 8x + 19 
12x + 31 16 



(1 +x)(2 + x)(5 + x) 19 + 19x + 4x 2 
29 + 31x + 6x 2 36 + 16x 



then we have that 
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1. |M| > for all x. 

2. |M'| > for all x. 

3. |M + aM'| > for all x and a > 0. 

The last condition suggests that |M| and |M'| "positively interlace" in some 
sense. Indeed, M and M' arise from polynomials f and •1^ that do interlace, 
as polynomials of three variables. The reason why we require a to be non- 
negative is that M — aM/ can have negative coefficients, and the result we are 
using requires that all the coefficients are positive. 

The following lemma arises naturally when considering the linear trans- 
formation f i ► f + af for polynomials with complex coefficients (Corol- 
lary |2a38|. 

Lemma 1.55. If f < g (or f <c g) have positive leading coefficients then 

f f , g, < 0. If f < g then the determinant is also negative except at common roots of 
f and g where it is zero. Thus, zff < g then f + af ' < g + ag' for all a G R. 



Proof. We give two proofs. Since f < g it follows from Corollary ll.29l that f + ag 
has all distinct roots for all a. Since differentiation preserves < we can apply 
Lemma [1.5 II to the interlacing f + ag < f ' + ag'. 

If we don't use Lemma [1.511 then we can use partial fractions. If f < g we 

write Q — Y. bi — - — where f (x) = a Yl( x ~ a 0/ an d all bt are positive. Now 



x — cu 



f g' - gf 
f 2 



X — Cli 



■L 



bi 



X — Qi 



< 



If x is not a root of f then 



f g 
f g' 



< 0. If x is a root of f then the determinant is 



equal to — f'(x)g(x) which is negative since f and g interlace f. 

If the interlacing is not strict, then write f = hfo and g = hgo where fo < go- 



Since 



f g 
f g' 



h 1 



fo go 

f o 9o 



the result follows from the first part. 

If f <C g then there is a positive a and polynomial h with positive leading 
coefficient such that g = af + h and f < h. The result now follows from the 
first case since 

f af+h 
f af' + h' 



f g 

f 9' 



f h 
f h' 



□ 



An immediate corollary (using f < f) is a well known property of deriva- 
tives called the Laguerre inequality. 

Corollary 1.56. Ifi has all real roots, all distinct, then | f f , p, | < 0. 
There is a converse to Lemma [l.55l 
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Lemma 1.57. If f e P, the degree of g is at feast one less than the degree the off, and 



f g 
f g' 



^ 0/or a// x then f and g interlace. 



Proof. It suffices to show that for all a the polynomial f + ag is in P. When 
a = we know that f e P. If there is an a such that f + ag ^ P then there is 
an a for which f + ag has a double root, r. In this case we have that 

= f(r) + ag(r) 
= f'(r) + ag'(r) 



so the determinant 



f g 
f g' 



is zero for this value of r. □ 



If all linear combinations of f and g have all real roots, then the corollary 
tells us what direction they interlace. 

Corollary 1.58. If the following three conditions hold then f <|C g: 

1. f, g have the same degree. 

2. f + ag e P for all a e R. 



3. 



f g 
f a' 



< 0/or «ZZ x. 



The next lemma allows us to conclude that two polynomials interlace from 
interlacing information, and knowledge of a determinant. 

Lemma 1.59. Suppose that f < — g, h < — k are polynomials with positive leading 
coefficients whose degrees satisfy deg(f ) ^ deg(h) and deg(g) ^ deg(k) where all 
"< — " are " <c " or all are "< " . The ' following are equivalent 

9 
h 

2. The determinant | £ £ | is negative at roots ofi ghk. 



Proof. If r is a root of g then the determinant is f (r)k(r) which is negative by 
the interlacing assumptions. The argument is similar for the roots of g, h, k. 

Conversely, if the determinant is negative at the i-th root r of g then we 
have that f (r)k(r) < 0. Since f sign interlaces g, so does k sign interlace g. 
Similarly we see that h sign interlaces f . The degree assumptions imply that 
these sign interlacings are actually interlacings. □ 

A determinant condition gives information about the signs of the leading 
coefficients. 

Lemma 1.60. Assume that f < — h < — k, and that there is a polynomial g such 
that | * £ | < 0. Then the signs of the leading coefficients ofi and k are the same. 
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Proof. Without loss of generality we may assume that the leading coefficient 
of h is positive. Suppose that r is the largest root of h, the leading coefficient of 
f is Cf , and the leading coefficient of k is Ck- Since the determinant is negative 
we know that f (r)k(r) is negative. Since f < — h the sign of f (r) is — sgn(cf ), 
and similarly the sign of k(r) is sgu(ck). This implies that Cf and Ck have the 
same sign. □ 



1.9 Polynomials from determinants 



We can construct polynomials from determinants. Although the constructions 
yield nothing new in one variable, they are an essential ingredient for the 
construction of polynomials in two variables. We emphasize the properties 
of principle submatrices that correspond to properties of f (x)/(x — rO where 

f(x)=Il(x-Tlc). 

We consider determinants of matrices of the form xl + C where C is a 
symmetric matrix and I is the identity matrix. More generally we consider 
xD + C where D is a positive definite matrix, but these are really no different. 
Since D has a positive definite square root, 

|xD + C| = ID" 1 ! |xl + D- 1/2 CD- 1/2 | 

and D~ 1/2 CD~ 1/2 is symmetric. 

The determinants xl + C| generalize the product representation of a poly- 
nomial. If f (x) = (x— rj) • • • (x— r n ) then f (x) = |xI + C| where C is the diagonal 
matrix whose diagonal is {— r l7 . . . , — r n }. 

One important well known fact is that |xl + C| has all real roots. Indeed, 
since C is symmetric, its eigenvalues are all real, and the roots of |xl + C| are 
the negatives of these eigenvalues. 

If r is a root of f then the analog of f(x)/(x — r) is a principle submatrix. 
If M is a d by d matrix, and a C {1,2, . . ., d} then M{a} is the (sub)matrix of 
M formed by the entries in the rows and columns indexed by a. We let M[i] 
denote the submatrix resulting from the deletion of the ith row and column of 
M. Such a submatrix is called a principle submatrix. 

The maximal factor f(x)/(x — n) interlaces f(x). The analogous statement 
is also true for principle submatrices. The following is well known, but the 
proof below is elementary. 

Theorem 1.61. If A is annbyn Hermitian matrix, and B is any principle submatrix 
of order n — 1 then their characteristic polynomials interlace. 



Proof. Choose a e K, partition A 



and consider the following 



equation that follows from linearity of the determinant: 



B-xI 


c 




B - xl 


c 




B - xl 


c 


c* 


d — x + a 




c* 


d — x 


+ 





a 



CHAPTER 1. POLYNOMIALS IN ONE VARIABLE 



32 



Since the matrix on the left hand side is the characteristic polynomial of a 
Hermitian matrix, |A — xl| + <x|B — xl| has all real roots for any ct, and hence 
the eigenvalues interlace. □ 

The derivative of f (x) = Y\i ( x — r n ) can be written as 



d , f(x) , , f(x) 
■f(x) = 



dx x — ri x — r n 

There is a similar formula for determinants. Suppose that C is a n by n matrix. 

— |xl + C| = |M[1]| + • • • + |M[n]| (1.9.1) 
dx 

The factors of a polynomial with all distinct roots have all distinct roots. 
The same holds for principle subma trices. This is immediate since the roots of 
the submatrix interlace the roots of the original matrix. 

Finally, we have some ways in which principle submatrices don't general- 
ize properties of maximal factors. 

Example 1.62. If the roots of a polynomial f(x) = nl^* - r i) are distinct, then 
the polynomials f (x)/(x — TjJ are linearly independent, and span the space of 
all polynomials of degree n— 1. The matrix ( * * ) has two distinct roots {1, — 1}, 
yet the two principle submatrices are equal. 

Example 1.63. If the roots of f (x) are ordered r\ < • • • < r n then the factors 
are mutually interlacing: 

f(x) f(x) f(x) 

< — < ••■ < 



X — Ti X — T2 x — r n 

Although the u principle submatrices have characteristic polynomials that in- 
terlace the characteristic polynomial of M, it is not the case that these subma- 
trices even have interlacing characteristic polynomials, much less mutually 
interlacing ones. They only have a common interlacing. 

We can interpret l|1.3.3|l in terms of determinants. This representation of 
g(x) will prove useful when we consider polynomials in two variables. Since 
the Ci in dl.3.3l > are negative the vector 



is real. We assume that f(x) is monic so that f(x) = f7(x — at), and define 
the diagonal matrix A whose diagonal entries are a\, . . . , a n . Let I be the n by 
n identity matrix, and |W| the determinant of a matrix W. Expanding by the 
first row shows that 
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X + C V 

v* xl - A 



X+C V — C l V~ C 2 

V - c i x — a! 
y/—C2 x — a 2 



(x + c)f(x) +CJ 



glxj 



f(x) 
x — ai 



v 7 ^ 





X- Q r 



f(x) 



X- Q r 



(1.9.2) 



Equation ^1.9.2t shows that g G P since g is the characteristic polynomial of 
the symmetric matrix ( v c t _ V A ) . 



1.10 A relation weaker than interlacing 

The relation ■< is weaker than interlacing, yet still has some of the properties 
of interlacing. In particular, it is sometimes preserved by differentiation, and 
there are inequalities for 2 by 2 determinants of coefficients. 
Consider some simple examples and properties. 

1. (x+ l) n + x n 

2. If h A g and g A f then h < f . 

3. If g -4 f and s ^ r then (x — s) g < (x — r) f. 

4. If g -4 f and heP then hg < hf. 

5. If g -4 f and f 6 P pos then g A x k f for any positive integer k. 

6. If f < g then g -4 f . 

7. If f <Cg then we generally do not have g 4 f since the smallest root 
belongs to f . 

g -4 f determines a chain of interfacings. 

Lemma 1.64. Ifg M f fejfe the same degree then there are hi, . . . , h^ such that 



g =hi > h 2 > 
g = hi -4 h.2 



> h k =f 
h k = f 



If f and g /zaue different degrees then we can write f = hk where g -4 h, and h /zas 
f/xe same degree as g. 
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Proof. Let g = n^t* — s an d f = Y\ii x ~ r 0- The following interlacings 
give the desired sequence of polynomials. 



/ m \ /m— 1 \ m 



For the second part, we take k to be the polynomial whose roots are the 
n — m largest roots of f . □ 

Differentiation does not preserve if the degrees are different. For exam- 
ple, if f = (x + l)(x + 2) and g = x(x + l)(x + 2), then f < g but f < g' fails. 
We do have 

Lemma 1.65. If g f /zax?e t/ze same degree then g' f '. 

Proof. The result follows by differentiating the chain of interlacings in the pre- 
vious lemma. □ 

Lemma 1.66. Suppose that g A f, f and g have all positive coefficients, and f = 
2 II l a i x i ,g = L 1 m b i x\ Tten 



bt bj 
en dj 



^ /or 1 ^ i < j n 



Proof. We first assume that f and g have the same degree. In order to show 
that btcij ^ bj at, we note that this is equivalent to ^ > ^f. Thus it suffices to 
take j = i + 1 since then 

hi > b i+ i bj 
Hi a i+ i aj 

Since ^ is preserved by differentiation, we can differentiate i times, and so 
assume that i = and j = 1. Write f = T\™(x + n), g = Y\Ti x + s where 
< Ti < si for 1 < i ^ m. Then 



bo bi 
a ai 



IT* UTiLTi- 



Now assume that the degrees are different. Note that H Vi — T. o^x 1 , 
p 2 = Y. Pi* 1 and P3 = H Yi* 1 then 



Pi |3j 
at a. 



> and 



Yi Yj 
Pi Pi 



>0 



at aj 
Pi Pi 



> 
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By Lemma H.64l it suffices to prove the lemma for g and (x + <x)g where a > 0. 
If g = 22 biX 1 then the determinant is 



abi + bi_i abj + b,_i 



bt bj 
bi-i bj_i 



bi b i+ i 
bi_j b t 



^ 0, but this is Newton's inequal- 

□ 



As above, it suffices to show that 
ity (Theorem |4.8l |. 

A matrix is TP2 is all 2 by 2 submatrices have non-negative determinant. 
Lemma 1.67. If fj A f 2 A ■ ■ ■ A f k are in p vos then the matrix of coefficients of 

h 



z's TP? 



Proof. Since transitivity of implies that f i -4 f j for i < ) the result follows 
from Lemma [1.66l □ 

Here's a typical corollary: 
Corollary 1.68. If q a f are in P pos fen 

9 



9 



3 bi b 2 

bo bi b 2 

b bj b 2 ••• 

ao ai a2 

ao ai a2 • • • 

ao ai a2 



is TP 2 



Proof. The rows are the coefficients of the sequence of polynomials that satisfy 
the relations 

g A xg A x 2 g A x 2 f A x 3 f A x 4 f 

The result now follows from Lemma [1.6 71 

□ 

Example 1.69. Here is a example that uses A to get an inequality for the lo- 
cation of the roots of the derivative. Suppose that f = OiM* — r k)- Since 
f A (x — Tk) k (x — r n ) n ~ k we differentiate to find that 



f ' A (x - r^- 1 (x - r n ) n - k - : (nx - kr r 
It follows that the kth root of f ' lies in the interval 

[i"k,r k + -(r n - Tic)]. 



(n-k)r k ) 
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1.11 Polytopes and polynomials 

A polynomial ao + cux + • • • + a n x n corresponds to the point (ao, . . . , a n ) in 
K n+1 . Thus, we can relate geometric properties to algebraic properties. In 
particular, we study two questions: 

1. When does a polytope CP in R n+1 have the property that all points in CP 
determine polynomials with all real roots? 

2. When do polytopes CP and CP' have the property that every polynomial 
determined by points in CP interlaces every polynomial determined by 
points in CP'? 

There are simple answers when the polytopes are products of intervals, and 
the proofs are based on sign interlacing. If we have vectors a = (at) and 
b = (bi) then we define 

poly(a) = ao + ajx + a2X 2 + CI3X 3 + CI4X 4 -)-••• 
poly(a,b) = a + bix + a 2 x 2 + b 3 x 3 + a 4 x 4 H 

Lemma 1.70. Suppose that < a ^ b, and that there exists an f satisfying the 
interlacing f <C po/y( a,b) and f -C po/y(b,a). Then, for all c satisfying a < c < b 
we have that f < poly(c). 

Proof. If the roots of f are r\ < ■ • ■ < r n , then poly (a, b) and poly (b, a) sign 
interlace f, and so the sign of either of them at Tt is (— l) n+t . The roots are 
negative since all the coefficients are positive, so the conclusion will follow 
from 

poly(a,b)(— a) < poly(c)(— a) < poly(b,a)(— a) for cO 
Expanded, this is 

ao — bi a + a 2 a 2 — b 3 a 3 + Q4 a 4 • • • < Co — Ci oc + C2 a 2 — C3 a 3 + C4 oc 4 • • • 

^ bo — ai a + b2 a 2 — Q3 a 3 + b4 a 4 • • • 

which follows from 

Q2i o? x ^ C21 a 21 < b2i a 2t 
— t>2i+i a ^ — C2i+i a ^ — a 2 i+i a 

Figure [LZl shows the configuration of poly(a,b), poly(b,a) and poly(c). 

□ 

Next, we show that we don't need to assume common interlacing. 

Lemma 1.71. Suppose that < a ^ b, and that both po/y(a,b) and poly[b, a) 

have all real roots. Then poly{a,b) and poly(b, a) have a common interlacing. 
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Figure 1.7: Graphs of interval polynomials 

Proof. We prove this by induction on the degree n. We assume that a ^ b. 
From the proof of Lemma 11.701 we know that the graph of poly(b, a) lies 
strictly above the graph of poly(a,b) for negative x. When n = 2 there are 
two possibilities for the graphs of poly(a,b) and poly (b, a) (Figure [T78|> 




Figure 1.8: The possibilities for a quadratic 

Both of the roots of one quadratic can not be to either the left or the right 
of the roots of the other quadratic, for then there is an intersection. Thus, they 
must be arranged as in the right hand picture, and there is clearly a common 
interlacing. 

We now consider the general case. The key point is that we can differenti- 
ate. If we define 

a' = (Q 1 ,2a 2 ,3a 3 , • ■ ■ ,na n _i) b' = (bi,2b 2 ,3a 3 , • • ■ ,na u _i) 

then we have the relations 



poly(a,b)' = poly(b',a') 



poly(b,a)'= poly(a',b'). 
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The inductive hypothesis implies that poly (a, b) ' and poly (b, a) ' have a com- 
mon interlacing. 

Suppose that poly(a,b) has roots r\ ■ ■ ■ < r n . Both poly(a,b) and 
poly (b, a) are positive on the intervals 

(T"n-2, T n -l), (Tn-4, T n -3), ■ • ■ 

and so the only place that poly (b, a) can have roots is in the intervals 

(Tn-l/tn), (r n -3/T n -2) 

We now show that none of these intervals can have more than two roots of 
poly (b, a). Figure ll.9l is an example of what can not happen. We observe the 




Figure 1.9: An impossible configuration 



simple property of polynomials g, h with a common interlacing: 

For any x € R the number of roots of g greater than x and the 
number of roots of h greater than x differ by at most one. 

We now see that Figure 11.91 is impossible, since the two largest roots of 
poly (a, b) ' are greater than all the roots of poly(b, a) '. Moreover, if there were 
two roots of poly(b, a) in the rightmost solid interval labeled Q, then there are 
5 roots of poly (b, a) ' to the right of the point p, but only 3 roots of poly (a, b) ' 
to the right. 

Continuing to the left, suppose up to some point that we find all intervals 
with poly(a,b) negative have two roots of poly(b,a). The next interval to the 
left can't have roots, for the same reason as above. Also, it can't have more 
than two roots, for then there would be too many roots of poly(b, a)'. □ 

Combining the last two lemmas we get 

Proposition 1.72. Suppose that < a < b, and that both po/y(a,b) and 
poly(b, a) have all real roots. Then, for all c satisfying a < c < b w e have that 
poly(c) e P. 
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Remark 1.73. If we drop the hypotheses that that the coefficients are positive, 
then there might be no common interlacing, and the conclusion fails. If we 
take a = (1,-2,1), b = (1,2,1) and c = (1,0,1) then a < c ^ b. However, 
there is no common interlacing, and 

poly (a, b) = l + 2x + x 2 eP 
poly(b,a) = l-2x + x 2 eP 
poly(c)=l + x 2 ^P 



Corollary 1.74. I/O^a^ b, po/y(a,b) and po/y(b,a) have all real roots then all 
points in the product of intervals 

a x b = (a , b ) x (a^bi) x • • • x (a n ,b n ) 

determine polynomials in P. 

Lemma 1.75. Suppose that < a < c ^ b and a' ^ c' ^ b'. If 

po/y(a,b)l f po/y(a',b') 

po/y(b,a)/ ^ \po/y(b',a') 

then poly(c) < po/y(c'). 

Proof. Two applications of Lemma [l . 701 show that 

poly(a,b)< poly(c') poly(b,a)< poly(c') 

The proof of the lemma holds if we replace ^ by so we conclude that 
poly(c)< poly(c')- □ 

The Lemma shows that all points in the box a x b determine polynomials 
that interlace all the polynomials corresponding to points in a' x b'. 

1.12 The Faa di Bruno problem 

The Faa di Bruno formula is an expression for the m'th derivative of a com- 
position. We conjecture that certain polynomials associated with this formula 
have all real roots. We are able to establish this in the first few cases, for e x , 
and for x d . If f and g are polynomials then the Faa di Bruno formula ll97l is 

d m 

-(f(g(x)) = 



dx 

m! k , ,. / g'lxl V ! / g'-'lx] \ I 9'""^ 



2_ bl! ... bm! t I, 



2! J \ m! 

where the sum is over all bj + 2b2 + • • ■ + mb m = n and \>\ H h b m = k. 
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m 



f(g(x)) 

1 f'(g(x))g'(x) 

2 f"(g(x))g'(x) 2 + f'(g(x))g"(x) 

3 f< 3 '(gW) g'(x) 3 +3f"(g(x)) g"(x)g'(x) + f '(g(x)) g (3) (x) 

4 f( 4 »(g(x))g'(x) 4 + 6g"(x)f( 3 '(g(x))g'(x) 2 + 

4f"(g(x)) g' 3 »(x) g'(x) + 3f "(g(x)) g"(x) 2 + f'(g(x)) g( 4 >(x) 



We can simplify these expressions by writing 

— (f(g(x))=^fl k '(g(x))A m , k (x) 

and defining 

Fm(x,y) =^/A m , k (x) 
Here are the first few values of F m (x,y) 



m F m (x,y) 

1 

1 g'(x)y 

2 g'(x) 2 y 2 + g"(x)y 

3 g'(x) 3 y 3 + 3g"(x)g'(x)y 2 + g (3) (x)y 

4 g'(x) 4 y 4 + 6g"(x)g'(x) 2 y 3 + [4g< 3 ' (x)g'(x) + 3g"(x) 2 ]y 2 + g' 4 ' (x) y 



Note that F m only depends on g, and not on f . If a and m are fixed the 
transformation g i— > F m (a,y) is not a linear transformation. It follows from 
[95 ] that F m satisfies the recurrence 

Fm+i(x,y) = — F m (x,y) + g'(x) y F m (x,y) (1.12.1) 
dx 

If we iterate this we get the simple formula 

F m = (D x + g'(x)y) m (l) 
We can recover , „ (f (g(x)) from F m as follows: 

^:(f(gM) = (Y_ D^A m , k( x)) (Z f(k) (9W)^) 

Note that the term on the left is F m (x, D y ) and the term on the right is the 
Taylor series of f (g(x) + y ). 
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Conjecture 1.76. Suppose that g £ P. For any ael and m = 0, 1, . . . , 

1 . F m (oc,y ) has all real roots. 

2. F m+ i(ac,y) < F m (a,y). 

We can verify this in the first few cases. 
m=l We need to show that g ' ( ot)y £ P, which is trivial. 
m=2 This is again trivial, since 

F 2 (a,y) =y(g"(a) + g(a) 2 y 

m=3 Factoring out y we have a quadratic whose discriminant is 

(3g"(a)g'(a)) 2 -4(g'(a)) 3 g"'(a) = g'(a) 2 [9g"(a) 2 - 4g'(a)g"'(a)] 

and this is non-negative by Newton's inequalities. 

It takes a bit more work to verify interlacing F 3 <. F 2 . Factoring out y we 
need to show that for all a 

(g') 3 y 2 + 3g"g'y + g"'<(g') 2 y + g" 

If we substitute the root of the right hand side into the left hand side and 
multiply by (g') 2 we get 

g'(g") 2 - 3(g")V + g"'(g') 2 = g'(g'g"' - 2(g") 2 ) 

By Newton's inequalities this has sign opposite from the leading coeffi- 
cient of F3, so F3 < F 2 . 

The conjecture holds for g(x) = e x . 

Lemma 1.77. If g = e x then Conjecture \1.76\ holds. 

Proof. The key fact, which follows from |97], is that F m has a simple form if 
g = e x . That is, 

Fm(x,y) = B m (e x y) 

where B m is the Bell polynomial. We know that B m (x) < B m _i(x), and there- 
fore for all a we have that B m (ax) <jB m _i(ocx). □ 

The conjecture also holds for x d . 

Lemma 1.78. Ifg — x d then Coniecture \1.76\ holds. 
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Proof. To prove the lemma it suffices to show that there are polynomials H n 6 
P satisfying 

x n F n (x,u) =H n (x d y) 
and H n < H n _i . For example, if d = 3 we compute using (1.12.1) that 



n 


x n F n (x,y) 


Hn(z] 





1 


1 


1 


x(3x 2 y) 


3z 


2 


x 2 (9y 2 x 4 + 6yx) 


9z 2 + 6z 


3 


x 3 (27y 3 x 6 + 54u 2 x 3 + 6y) 


27z 3 + 54z 2 + 6z 


4 


x 4 (81u 4 x 8 + 324u 3 x 5 + 180u 2 x 2 ) 


81z 4 + 324z 3 + 180z 2 



We define H m by the recursion 
Ho = l 

H m+1 =-mH m + dz(H m + Hj (1.12.2) 
We will prove that F m = H m (x d y ) by induction, and it holds for m = 0. 

x m+1 F m = x m+1 (F^ + dx^ 1 y F m ) 
= x m+1 F^ + dx d+m uF m 
H m+ i(x d u) = -mH m (x d u) + dx d y(H m (x d u) + H'(z)[x d u]) 
= -mH m (x d u) + dx d yH m (x d u) + xH^(x d y) 
= -mx m F m + dx d ux m F m +x(x m F m )' 
= dx d yx m F m +x m+1 F^ 

Since F m has all positive coefficients H m has all positive coefficients, and 
by induction H m G P pos . The recursion jl. 12.2b shows that H m < H m _i. □ 

1.13 Root Trajectories 

If we are given polynomials p (x, oc) where a is a parameter, then we can inves- 
tigate the movement of the roots of p(x, oC] as a varies. There is a large body 
of literature in the case that p(x, oc) is an orthogonal polynomial [50. 128 [. We 
limit ourselves to two simple cases. We first look at the effect of small pertur- 
bations of the roots of a polynomial on the roots of the derivative, and then 
consider the root trajectories of f + ag where f < — g. See Figure |20.1 1 for 
trajectories of f + it g in C. 

Lemma 1.79. Iff e P, and we increase the roots off then the roots off also increase. 
® 
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Proof. Without loss of generality we may assume that the roots of f are dis- 
tinct. If g is the polynomial resulting from increasing the roots of f by a suf- 
ficiently small amount, then g <C f • Consequently g' <C f, so each root of g' is 
larger than the corresponding root of f ' . □ 

Lemma 1.80. If f < — g have positive leading coefficients then the roots off + ag 
decrease as a increases. 

Proof. We'll give three proofs! The simplest proof requires results from Chap- 
terl3l(Corollary l3T8] l: if a > |3 then the matrix ( * ? ) preserves interlacing, and 
hence f + gg <C f + ag. 

Next, a proof that uses the product representation of f . We assume that f, g 
have no roots fn common. Since f < — g we can write 



g 



(3f + Tbi — 

± Y t 



where |3 ^ 0, the bi are non-negative, and the cu are the roots of f . A root r of 
f + ag = satisfies 



= f(r) ( 1 + a|3 + a V b — — 

V t r - a \ 



Since f (r) ^ 0, we remove the factor of f (r), and differentiate (remember r is 
a function of a) 



* — r — ai * — r — at 

i i 

and solving for r' shows that r' is negative 



da 



r a 



V ^— (r - at 



bt 



Finally, a conceptual proof. We assume that a is non-negative; the case 
where it is non-positive is similar. When a = the roots of f + ag are the roots 
of g. As a — > oo the roots go the roots of g. The roots are continuous functions 
of a. Since every real number r is the root of at most one equation f + ag = 
(choose a = — f(r)/g(r)) it follows that the roots start at roots of f for a = 0, 
decrease to the roots of g, and never back up. □ 



1.14 Recursive sequences of polynomials 

We give two different ways of constructing sequences of interlacing polyno- 
mials. The first is to use a recurrence; the second is the use of log concavity. 
The interlacing constructions of previous sections can be iterated to create in- 
finite sequences of interlacing polynomials. 
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Lemma 1.81. Suppose {at}, {bt}, {ct} are sequences where all a\ and d are positive, 
and the bi are unrestricted. Define a sequence of polynomials recursively by 

P-i =0 
Po = 1 

p n = (d n X + b n )p n _i -C n p n _2 for H ^ 1 

These polynomials form an infinite interlacing sequence: 

Pi >P2 >P3 > • ■ ■ 

Proof. We first note that P2 evaluated at the root of pi is — C2 which is nega- 
tive. This shows that P2 < Pi; the general case now follows by induction from 
Lemma [l.25l □ 

It is important to note that the sequences of Lemma 11.811 are exactly the 
sequences of orthogonal polynomials; see H1681 page 106]. These polynomials 
have the property that the degree of p n is n; we can also construct sequences 
of interlacing polynomials that all have the same degree. 

We can sometimes replace the constant c n by a quadratic: 

Lemma 1.82. Suppose that f n e P pos is a sequence of polynomials satisfying the 
recurrence 

fn+lM = £tvM fa(x) - qrt(x) f n -lM 

where 

1. l n is linear with positive leading coefficient. 

2. q n is quadratic with positive leading coefficient. 

3. q n is positive at the roots off n . 
then f n+1 <f n . 

Proof. This follows from Lemma [1.251 □ 

Here's an example where it is easy to verify the third condition. 

Corollary 1.83. Suppose that f n e pv° s f s a sequence of polynomials satisfying the 
recurrence 

fn+l(x) = (a n X + b n )f n (x) +x(c n X+ d n )f n _i(x) 

where c n < and d n ^ 0. Then f n+ i < f n . 

Proof. Since f n e pP° s its roots are negative, and when r < we have that 
r(rc n + d n ) < 0. Now apply the lemma. □ 
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Example 1.84. The Narayana polynomials N n are defined by the formula 

1 /n\ / n 



n \kj \k-l 

i=0 



x k . 



and satisfy the recurrence 



N n = — !— ( (2n - l)(x + 1) N n -i - (n - 2)(x - l) 2 N n -2 



n + 1 

It follows from Lemma lL82l that N n < N n -i- 

Example 1.85. We show that the sequence of commutators determined by the 
Narayana polynomials is an interlacing sequence. Let T(x n ) =N U , and define 

fn = [D,T]( X n )=N n -uNn-l 

The f n s satisfy the recurrence relation 

f - = (n + 1)^- 1)2 (n 2 -n-l + (n 2 -n + l)x) f ^ 

,n " 2m2 (x-D 2 fn- 2 



fn-l) 2 (n + l) 



Now f 3 = 1 + 3x, so using induction and the recurrence we conclude that 
f n (l) ^ 0. It follows from Lemma [L82l that 

■ • • < fn < fn-l < fn-2 • ■ ■ 

Example 1.86. In [116] they considered the recurrence relation 

Pn+l = 2x P n - (x 2 + (2n - l) 2 ) pn-l. 

This satisfies all the hypotheses of Lemma 11.821 so we conclude that all p n 
have all real roots, and consecutive ones interlace. 

A small modification to the recurrence in Lemma [l .811 determines a series 
of polynomials that are known as orthogonal Laurent polynomials .[46] 

Lemma 1.87. If all coefficients a n , h n , c n are positive, a n > c n , and 

P-i =0 
Po = 1 

Pn = (a n X + b n )Pn-l -CnXPn-2 fom~^\ 

then pn € P alt , and pi > P2 > P3 > • • • 
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Proof. We proceed by induction. The case p 2 < pi is easy, so assume that 
Pn-i < Pn-2- The hypotheses on the coefficients implies that the coefficients 
of all pk alternate in sign, so p n -i and p n -2 have all positive roots. Conse- 
quently 

d n xp n _i <p n -l «C n XPn-2 

and so 

Pn = d n xp n _i - C n Xp n -2 < Ptt-1 

Note that from f < g h we can not always conclude that f — h < g . We can do 
so here because the leading coefficient of a n xp n -i is greater than the leading 
coefficient of c n xp n -2 by the hypothesis that a n > c n . 

□ 

Remark 1.88. The reverse of a polynomial f (x) of degree n is x^f (1/x). The re- 
verse of a (Laurent) orthogonal polynomials also satisfies a simple recurrence. 
For orthogonal polynomials as in Lemma [l.81l the recurrence is 

p- V = (On + b n x) p^j - Cn X 2 p- V 2 

and for Laurent polynomials as in 

Lemma[L8Zl 

p- v = (On + b n x) V ™ t - C n Xp- V 2 

If we modify the signs of coefficients in Lemma ll.81l then we still know the 
location of the roots. See page|50]for more results. 

Lemma 1.89. Suppose {at}, {ct} are sequences of positive terms. Define a sequence 
of polynomials recursively by 

P-i =0 
Po = 1 

Pn = a n xp n -l + CnPn-2 far H ^ 1 

All the roots o/p n are purely imaginary. 

Proof. It is easy to see that p n only has terms of degree with the same par- 
ity as n. Consequently, if we define q n (x) = i n p n Ux) then q n has all real 
coefficients, and satisfies the recurrence 

q n = a n x q n -i - c n q n -2 

Thus q n has all real roots, and so p n has purely imaginary roots. □ 

Example 1.90. The d-dimensional cross polytope is 

d = {(xi, . . .,x d ) G E d | |xi| + • • • + |x d | < n} 

The number of integer points pa in Od having exactly k non-zero entries is 
2 k ( d ) (V) since there are 
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• 2 k choices of sign for the non-zero entries. 

• (£) choices for the non-zero coordinates 

• (£) positive integer solutions to x\ + ■ ■ ■ x^ ^ n 

and therefore pa = 2^k=o 2 k (£) (£) ■ Pd is known as the Ehrhart polynomial of 

d - If we define q d = Lk=o 2k (k) v 2 ) tnen we can verif y tnat Id satisfies 
the recurrence relation 

2 d-1 

qd = ^^qd-i + — j — id-2 

d d 

and thus qd has all imaginary roots. It follows from Lemma [1.891 that the real 
part of all the roots of pd is —1/2. (See [26|.) 

Example 1.91. Assume that fo has all positive roots and define a sequence by 
the recurrence f n +i = f -n. + x ^' n - All these polynomials have the same degree, 
all real roots, and (|1.14.1|l is satisfied. (See Lemma B.llD 

We now briefly discuss log-concavity. 

A function f is strictly concave if for every choice of distinct a, b in the do- 
main of f we have 

f(a)+f(b) ,/a + b\ 



We say that f is strictly log concave if its logarithm is strictly concave. This is 
equivalent to 

f(a)f(b)<f(^) 2 . 

If we restrict f to integer values (i.e., a sequence), then at the points 
i — 1, i, i + 1 a strictly log concave function g satisfies 

g(i-l)g(i+l)<g(i) 2 . 

A sequence of functions fi, f2, • ■ • is strictly log concave if the sequence deter- 
mined by evaluating at any point is strictly log concave. This is equivalent 
to 

Forallx, f n _i(x)f n+ i(x) < f n (x) 2 . 
We can extend Lemma [1.1 31 to chains of interlacings. 

Corollary 1.92. Assume that . . is a sequence of polynomials with positive 

leading coefficients satisfying 

1. The sequence is strictly log concave. 

2. The leading coefficients have the same sign, or alternating signs. 
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then 

fi <— h <— h <— U < • (1.14.1) 

The property of log-concavity is not much stronger than interlacing. 

Lemma 1.93. Suppose f, g, h are polynomials of the same degree with positive lead- 
ing coefficients and f -C g. Then, g <C h if and on/y zf i/zere zs a positive constant c 
swc/z that f, g, ch zs strictly log-concave. 

Proof. If f, g, ch is strictly log-concave then g -C h by Corollary 11.921 Con- 
versely, if g <C h then there is a c such that f, g, ch is log-concave if and only 
if fh/g 2 is bounded above. Since f, g, h have the same degree, the limit of 
fh/g 2 is finite as x goes to ±oo. Consequently, it suffices to show that fh/g 2 
is bounded above at each root of g. Now g 2 is positive near a root a of g, 
so in order for fh/g 2 to be bounded above near a it must be the case that 
f (a)H(a) < 0. If the roots of g are {ai < ■ • ■ < a n }, then sgnf (at) = (-l) n+i , 
and sgnh(at) = (— 1) TV ~ K+1 / so sgnf (ai)h(at) = — 1. We conclude that fh/g 2 
is bounded above. □ 



1.15 The positive and negative parts 

We consider properties of the positive and negative parts of a polynomial, and 
apply the properties to a recurrence. 

If f e P, then f neg is the monic polynomial whose roots are the negative 
roots of f, and f pos the monic polynomial whose roots are the positive roots 
of f . For example, if we know that f <C g then exactly one of these possibilities 
holds: 

f neg <g neg & f o S<<g pos 

f neg > g ne9 & f pos < g pos 



Similarly, if we know that f < g then there are exactly two possibilities: 

f neg >g ne9 &f pos <g pos 

f neg <g neg & f pos < g pos 



Lemma 1.94. The notation ^> < describes the interlacing of the positive and neg- 
ative parts of a pair of polynomials. If the polynomials are f , g, then this example 
means that f neg g neg and f pos < g pos . An arrow between boxes means the inter- 
lacing off, g is described in the first box, and the interlacing of f + xg, f is described 
in the target box. Then, we have 



> < 



< > 



< < 



> < 



< > 
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Proof. There are six assertions to be verified. We describe in detail how to 
prove one of them, namely 



< > 




< < 





The diagram to be proved is the following statement: 

f neg <g neg & f pos < g pos ^ j^neg < f neg & h pos >f pos 

where h = f + xg. We use a sign interlacing argument, which we illustrate 
with an example where f has two positive and three negative roots. These 
assumptions imply the arrangement of the roots in the diagram below. We 
can determine the sign of g and f + xg at the roots of f; they are given in the 
second and third lines of the diagram. There is a root in the leftmost segment 
since f and f + xg have asymptotes of opposite sign as x — * — oo. It follows 
that there is a root of f + xg in every segment of the first line of the diagram. 
This tells us where all the roots of f + xg are, and implies the conclusion. 

— — f — 9 — f — 9 — f — 9 — f — 9 — f 

+ - + + -+ sign of g 

- + - + - + sign of f + xg 

□ 

We can apply this result to a recurrence. 

Corollary 1.95. Consider a sequence satisfying the recursion pk = Pk-i + xpk-2- 
If {PI/P2} satisfy any of the six interlacing conditions of Lemma \1.94\ then all pk are 
in P. 

The recurrence q n = xq n _i + q n _2 looks like the recurrence for orthogonal 
polynomials, but the sign is wrong. We show that if qo = 1, qi = x then the 
roots are all on the imaginary axis, and the roots of consecutive terms interlace. 
(See Lemma H~89l ) 

By induction we see that q2 n only has even powers of x, and q2n+i only 
has odd powers. Write 

CfcnM = P2n(x 2 ) q 2n+ l[x) = Xp 2 n+l(x 2 ). 

Here are the first few values: 



k 


qk 


Pk 





1 


1 


1 


X 


1 


2 


1+x 2 


1+x 


3 


2x + x 3 


2 + x 


4 


1 + 3x 2 + x 4 


1 +3x + 



The recurrence for q translates into two recurrences for p: 

P2k+lM = P2k(x) + P2k-lM 
p 2k (x) = xp 2k -l(x) +P2k-2M 
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and as matrix equations these are 

A A / P2k A = (V2k+i\ 
[l Oj \V2^J \ P2k J 

A A A>2k+l\ = (V2k+2\ 
\l OJ \ P 2k J \V2k+lJ 

We claim that we have the interlacings 

P2k < P2k+1 > P2k+2 (1.15.1) 

The first is trivial since P2k < P2k-i- The second follows from Lemma [1.201 
In particular, all p^ are in P pos . 

Now q2n(x) = P2n(x 2 ), so all roots of q2n lie on the imaginary axis. The 
roots of q2n+i also lie on the imaginary axis, and include 0. The interlacings 
111.15. Il l imply that the roots of the interlace. 

If we had that q n = xq n _i + a n q n -2 where a n is positive then the same 
argument applies, and the roots interlace on the imaginary axis. 

1.16 Polynomials without all real roots 

We have seen many ways of showing that polynomials have all real roots; here 
we note that certain polynomials do not have all real roots. 

1. p n = l-|-x + 2 ^ + - • • + 2 £r does not have all real roots if n > 1. Sincep n = 
p n _i, if p n had all real roots, so would p n -i- The quadratic formula 
shows that P2 does not have all real roots, so Pn. does not either. 

2. q 2 n = 1 + y + h j£cy and q 2n +i = x + ^ + h ( 2 n + + i)! do not have 

all real roots for n > 0. Again, note that q2 does not have all real roots, 
and q n = q n _i. 

3. p n = l!x + 2!x 2 H h nix 11 does not have all real roots. If it did then its 

(n — 2)th derivative would have two real roots, but 

(A) n " Pn .(n-l)!(n-2) ! (^l T+ x + ^) 

and this polynomial has no real roots. A more conceptual proof is that 
the exponential map (see Chapter l7l7.111 takes p n to 1 + x + x 2 + • • • + x n . 
This polynomial isn't in P, yet the exponential map sends P to itself. 

4. We can use Lemma [1.551 to show that there are no real roots. For exam- 
ple, 

If f and g interlace and have no roots in common then the 
polynomial f g' — f g has no real roots. 

If a monic polynomial does not have any real roots then it is positive for all 
real values. Such polynomials are often stable - see Chapter |2T] 



Chapter 




Polynomials with all positive 
coefficients 

In this chapter we consider p pos , the collection of all polynomials that have 
all negative roots. If the leading coefficient is positive then all coefficients are 
positive. We introduce an interlacing relation ~ for such polynomials that is 
weaker than interlacing for P, but is the natural definition for P pos . We call 
this relation positive interlacing. 

2.1 Basic Properties 

We establish some simple properties of P pos . 
Lemma 2.1. Suppose that f g P pos . 

1. If gin P has all positive coefficients then g e p pos . 

2. Ifg G P and f < g then g G P pos . 

3. If g[x) e P then there is a positive a such that g(x + a) G P pos . 

4. f(x) G P pos ijff f(-x) G P alt . 

5. f (— x 2 ) has all real roots if and only iff [x] G P pos . 

6. f (x 2 ) ftfls aZZ real roots if and only iff G P alt . 

Proof. If f G P has all positive coefficients, and r is positive, then f (r) is a sum 
of positive terms, and hence r is not a root of f . In addition, is not a root 
since f (0) is the constant term which is positive by assumption. 

If f <g, the roots of f are negative, and are interlaced by the roots of g, so 
the roots of g are also all negative. Thus, g G P pos . For the next part, take a to 
be larger than the absolute value of the smallest root of f . 
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The roots of f (— x 2 ) are the square roots of the negatives of the roots of f (x), 
so the roots of f (x) are all negative. The case that f G P alt is similar. 

□ 

The next result constructs a polynomial in P from two interlacing polyno- 
mials in P pos . See Theorem EMI for a converse. 

Lemma 2.2. Iff < g, f, g e P pos , then 

1. f(-x 2 ) <xg(-x 2 ). 

2. f(-x 2 ) +xg(-x 2 ) e P. 

Proof. If the roots of f (x) are (— r 2 , . . . , — r^) then the roots of the new poly- 
nomial f (— x 2 ) are (±Ti, . . . , ±r n ). Consequently f (— x 2 ) <C xg(— x 2 ), which im- 
plies the lemma. □ 

Here's a different way to tell if a polynomial is in P* . 

Lemma 2.3. Iff and g have no factors in common, g(0) ^ 0, and f interlaces both g 
and xg then g is in P alt or p pos . 

Proof. If we remove a zero from the roots of xg then there will be two consec- 
utive roots of f with no root of g in between unless is the largest or smallest 
of the roots of xg. □ 

Recall (Corollary [T30| that if T: P — > P then the leading coefficients are 
all the same sign, or alternate in sign. If T: P pos — > P then there are more 
complicated sign patterns. See Question |3 

Lemma 2.4. If either T: P pos — ► P pos or T: P alt — ► P alt , T preserves degree, 
and T(l) is positive then the leading coefficient of T(x n ) is positive. 

I/T: P pos — ► P alt or T: P alt — >P v ° s ,andT preserves degree then the signs 
of the leading coefficients o/T(x n ) alternate. 

Proof. If T(x n ) = Y. Ci/aX 1 then for any a 

T( (x + a) n ) = T(x n ) + naTfx^ 1 ) + • • • 

= C n<n X n + (C n _i, n + naCn-i^-ilx 1 ^ 1 H 

If T: P pos — ► P pos then for any cc> the polynomial (x + a) n is in P pos , and 
so T((x + a) n ) is also in P pos . It follows that for any positive a both c n ,n and 
Cn.-i,n + n.ac n _i /n _i have the same sign. Choosing a sufficiently large shows 
that the leading coefficients c n;Tl and c n _i ;Tl _i have the same sign. 

If T: P alt — ► P alt then we choose oc < and the argument is similar. For 
the last part replace x by — x. □ 

If all the roots are positive then the coefficients alternate. Here's another 
alternation result. 
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Lemma 2.5. If a polynomial has all real roots and all the coefficients of odd degree are 
zero then the signs of the non-zero coefficients alternate. 

Proof. If f (x) is such a polynomial then the non-zero terms all have even de- 
gree, so f (x) = f (— x). Thus, the roots of f come in pairs n, —r\, which implies 

fM=n(x 2 -r?) 

i 

and therefore the non-zero coefficients alternate in sign. □ 

There is an interesting condition that guarantees that the signs of the coef- 
ficients alternate in pairs. 

Lemma 2.6. iff(x) and f (— x) interlace then the sign pattern of the coefficients off 
is ••• + + -- + + . 

Proof. We note that if 

f (x) = a + a x x + a 2 x 2 H 

then 

f (x) + f (-x) = 2(a + a 2 x 2 + •••)= 2g(x 2 ) 

Since the left hand side is in P it follows that g(x) has all positive roots. Simi- 
larly 

f (x) - f(-x) = 2x(ai + a 3 x 2 H ) = 2xh(x 2 ) 

implies h has all positive roots. Since g and h have all positive roots, their 

coefficients alternate, and so the coefficients of f have pattern ■ ■ ■+- 1 !-+■■■ 

since it is the intertwining of two alternating sequences. □ 

2.2 Interlacing in P pos 

Our definition of interlacing for P requires that all linear combinations of a 
pair of interlacing polynomials lie in P. Such a definition can not work if 
we replace P by Y v ° s , because if we choose a negative coefficient then the 
resulting polynomial could have both positive and negative coefficients. This 
motivates our definition: 

Definition 2.7. If f, g e P and deg(f ) > deg(g) then we say that f ~ g if and 
only if f + ag is in P for all positive a. We say that f ~ g if there are open 
neighborhoods Of, g of f and g such that for f G Of and g D e g we have 

fo ~ go- 

We write f <C + g if there is a positive |3 and h. e P with positive leading 
coefficient such that f ~ h and g = |3f + H. 
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It is clear from the definition that if f , g £ P pos satisfy f < g then f ~ g. We 
can express ~ in terms of < . 

Lemma 2.8. Suppose that f, g £ P have positive leading coefficients. The following 
are equivalent 

1. fig 

2. There isk £ P such that k<c f and k < g. That is, f and g have a common 
interlacing. 

Proof. If there is such a polynomial k then Lemma [1.31| implies that all positive 
linear combinations are in P pos . The converse follows from Proposition 1 1 .351 



Linear transformations that map P pos to itself preserve ~. The proof is the 
same as in Theorem ll.431 

Lemma 2.9. Suppose that T is a linear transformation that maps P v ° s to itself, 
and maps polynomials with all positive coefficients to polynomials with all positive 
coefficients. Then f ~ g implies that Tf ~ Tg. 

Unlike ^ , it is not true that f ~ g, h implies that g + Ke P. For example, if 

f = (x + 2)(x + 6)(x + 7) g = (x + l)(x + 3) h=(x + 4)(x + 5) 

_|_ _|_ 

then f ~ g, h but g + h has imaginary roots . In addition, even though f ~ h, 
f and f + h do not interlace. 

Next, we have a simple property of the derivative that will be generalized 
to homogeneous polynomials in Corollary l4.31 and to polynomials in two vari- 
ables in Corollary l9.93l 

Corollary 2.10. Iff<Q both have positive leading coefficients then 
f ~ — g'. Equivalently, f — ag' £ P for a > 0. 

Proof. Apply Lemma [l.31l to the inter lacings f < g < g'. □ 

Lemma 2.11. IffeP then f <+ xf. Ifi £ P alt then f < xf, and iff eP os then 
xf'<f. 

Proof. Since the leading coefficient of xf ' is positive it suffices to show that 
f + ctxf £ P for all positive a. So write f = n(x — T i) an d let g = xf. Then 



□ 




] But see p. [6391 
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This shows that f + axf interlaces xf for positive a, and hence is in P. 

The second part follows since f < f ', and is either greater or less than all 



2.3 Linear combinations in P pos 

In Proposition 11.351 we found that if af + (3 g have all real roots for all non- 
negative a, (3 then f and g have a common interlacing. The assumptions were 
not strong enough to conclude that f and g interlace. The next result gives 
assumptions about non-constant combinations that lead to interlacing. 

Lemma 2.12. Suppose that polynomials f, g are in p pos and have positive leading 
coefficients. In addition, assume that for all non-negative a, |3, y the polynomial ( ax+ 
(3)f + yg has all real roots. Then g < — f. 

Proof. If we apply Proposition II .351 we see that for all positive |3 the polyno- 
mials (x + |3)f and g have a common interlacing. If we choose —(3 smaller 
than the smallest root of f and g then since (x + |3)f and g have a common 
interlacing, it follows that the smallest root of g is less than the smallest root 
of f. If the combined roots of f and g have two consecutive roots of f (or g) 
then upon choosing — 13 in between these roots we find that there is no com- 
mon interlacing. It is possible to make these choices with positive |3's since all 
roots of f and g are negative. □ 

Lemma 2.13. If f + ccg and xf + ag are in p v ° s for all positive a then g < — f. 
Equivalently, iff ~ g and xf ~ g where f, g e P pos then g < — f. 

Proof. The only was that common interlacing of f and g, and of xf and g, can 
occur is if f and g interlace. □ 

It is not easy for linear combinations of polynomials to remain in P alt or 



Lemma 2.14. Suppose that f,g have the same degree, and that f+ag € P a t uP pos 
for all oc e M. Then at least one of the following is true 

1. g is a constant multiple off. 

2. There are constants a, b, c, d such that f = ( ax + b)x T , 
g = (cx + d)x r . 

Proof. We may assume that the degree is greater than 1. If it is not the case that 
g is a constant multiple of f then by Lemma 11.161 we can find a, |3 such that 
f + ag has a positive root, and f + (3 g has a negative root. Since by hypothesis 




the roots of f . 



□ 



ppos 
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we have that f + ag G P alt , an d f + (3g e P pos , it follows that for some 
a < y |3 we have that f + yg € P alt n P pos . Now the only polynomials in 

P alt nP pos are multiples of powers of x, so we find that f + y g = ex r for some 

e, r. Substituting shows that ex r + (a — y)g e P for all a. Consequently, g and 
x r interlace. This implies that g = (cx + d)x r or g = cx r_1 . The latter is not 
possible since f has degree at least r, so g = (cx+ d)x r . The result now follows 
by substituting for g. □ 

If f -C q then the largest root belongs to f, and the smallest to g. Thus, if 

f , g € P pos we have xg < f, and if f, g e P alt we know xf < g. 

Lemma 2.15. Suppose f <g have positive leading coefficients. If f, g € pv° s i nen 
xg + f < f. Iff, g e P alt then xf - g < g. 

Proof. Since g 6 P alt we know that xf < g, and therefore xf — g < g by Corol- 
lary ll.30l The second case follows by replacing x by — x. □ 

Corollary 2.16. Ifag (0,n) then 

<f a > n 

Iff e P pos (n) t/zen - af + xf <j >f a = n 

>f a < 

> f a > n 

Iff e P Qlt (n) then - af + xf \ >f a = n 

<f a < 

Proof. If f e P and a g (0,n) then -af + xf e P by Corollary [TT5] The 
remaining parts follow the usual arguments. □ 

2.4 Various linear combinations 

In this section we first look at linear combinations of the form 

h = (ax + b)f + (cx 2 + dx + e)g 

where f < g. We are interested in what restrictions we may place on a, b, c, d, e 
to conclude that h interlaces f, or that h is in P pos or P Qlt . The results are all 
easy consequences of Lemma [l.25l or or Lemma H. 201 

Theorem 2.17. Let Tf = (ax + b)f + (cx 2 + dx + e)g where c + 0, f e P pos (n) 
has positive leading coefficient, and f < g. Assume that cx 2 + dx + e has constant 
sign e on all roots off and (a + nc)e < 0/or all positive integers n. 

1. Tf<f. 

2. I/f e p(-°°- a ' and (a + nc) • (Tf)(a) >0thenTf e p(-°°- a >. 
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3. Ifi £ P' 01 ' 00 ' and (-l) n (a + nc) • (Tf)(a) > fen Tf £ p( a '°°). 

4. f/f £ P pos and (a + nc), b, e naz>e fe same sign fen Tf € P pos . 

5. 7/f £ P alt and (a + nc), b, e /zaz?e fe sign fen Tf £ P alt . 

Proof. In order to show that Tf < f we need to show that Tf sign interlaces f , 
and that Tf evaluated at the largest root of f has sign opposite to the leading 
coefficient of Tf (Lemma ll.8t . If z is a root of f, then (Tf )(z) — g(z) ■ (cz 2 + dz + 
e). Since g sign interlaces f, we must have that cz 2 + dz + e has constant sign 
on any interval containing roots (f). 

If z denotes the largest root of f, then the sign of (Tf ) (z) equals 
sgn g(z) sgn(az 2 +bz+c), and since f has positive leading coefficient, sgn g(z) 
is positive. The leading coefficient of Tf is a + nc, so a sufficient condition for 
Tf < f is that sgn(a + nc) sgn(az 2 + bz + c) is negative. 

Now assume that f £ p ( ^ 0O ' 0t) . Since Tf < f, the only root of Tf that could 
possibly be greater than a is the largest root. If (Tf)(a) has the same sign as 
the leading coefficient a + nc of Tf then there are no roots greater than a. 

In case f £ p(° c ' °) / we follow the preceding argument; the condition is that 
(Tf)(<x) has the same behavior as (Tf)(x) as x — ► — oo. 

For last two parts we use the fact that (Tf ) (0) = bf (0) + ef'(0). If f £ P pos 
then f (0) and f'(0) are positive. If f £ P alt then f (0) and f'(0) have sign (-l) n . 

□ 

The corollary below follows immediately from Lemma [l.20l 

Corollary 2.18. Suppose that f, g, h, k have positive leading coefficients and satisfy 
f<g<H,g<k. Then for all a and positive a, |3, y we have 

1. Ifg £ P pos then xf+ag + ah - |3f- yk< g 

2. If g £ P alt then xh + ag - ah + |3f + yk < g 

3. xg + (3h< g 

2.5 The positivity hierarchy 

If we are given a polynomial with all positive coefficients then there are many 
properties that it might have, and we should always check to see which prop- 
erties it does have. These properties are given in Figure I2TTI We assume that 
f = do + • • • + a n x n where all at are positive. The conditions get weaker as we 
move downward. 
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f has no real roots. 

f is positive everywhere. 



f has all real roots. 



f has exactly one real root, f is 
(linear) x (positive everywhere). 



f is stable. 

All roots are in the left 
half plane. 



Hi-2lli+2 



> 1 



The even and odd parts 
have all real roots. 



do, . . . , Q n is log concave. 



do, . . . , a n is unimodular. 

do «S • • • «S a k ^ • • • ^ a n 



Figure 2.1: The positivity hierarchy 



Chapter 




Matrices that preserve interlacing 



This chapter is concerned with the question: 

Suppose that M is a matrix, either a constant matrix, or with poly- 
nomial entries. Suppose that v is a vector of polynomials lying in 
P, or perhaps in P pos . When is Mv also a vector of polynomials in 
P? 

The answer when M is constant is roughly that M satisfy a property called 
total positivity. Such matrices are discussed in the first section, and the char- 
acterization is proved in the following section. The next section discusses two 
by two matrices that preserve interlacing, and Section 4 covers some results 
about general M. The final section has some examples of matrices that pre- 
serve interlacing for polynomials in P pos . 

3.1 Totally positive matrices 

In this section we recall properties of totally positive matrices. We will see that 
these matrices occur whenever we consider linear combinations of interlacing 
polynomials. See [5j and ||6T1 for the proofs of the theorems in this section. 

A matrix is strictly totally positive if all its minors are positive. If all the 
minors are non-negative we say it is totally positive. A simple consequence of 
the Cauchy-Binet formula for the determinant of the product of two matrices 
is that products of totally positive matrices are totally positive, and products 
of strictly totally matrices are strictly totally positive. 

A weakening of this definition is often sufficient. We say that a matrix is 
is strictly totally positive^ if all elements and all two by two determinants are 
positive. It is totally positive^ if all elements and all two by two determinants 
are non-negative. 

The first theorem is due to Whitney: 

Theorem 3.1 (Whitney). Every totally positive matrix is the limit of a sequence of 
strictly totally positive matrices. 
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The next theorem describes a useful decomposition of totally non-negative 
matrices: 

Theorem 3.2 (Loewner-Whitney). Any invertible totally positive matrix is a prod- 
uct of elementary Jacobi matrices with non-negative matrix entries. 

There are three kinds of elementary Jacobi matrices. Let Ey be the matrix 
whose i, j entry is 1 while all other entries are 0. If I is the identity matrix then 
the elementary Jacobi matrices are I + tEy + i, I + tEt+y, and I + (t — l)Ey 
where t is positive. For example, in the case i = 2 for 4 by 4 matrices they are 
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V 




\o 








V 




\o 








V 



(3.1.1) 



If ( £ a ) is strictly totally positive, then a, b, c, d > 0, and ad — be > 0. An 
explicit factorization into Jacobi matrices is 

(; 3 - (J 2) (; f) G 4-) U J) G f ) ««* 

3.2 Constant matrices preserving mutual interlacing 

Suppose that we have a vector v of polynomials in P. If A is a constant matrix 
when is Av also a vector of polynomials in P? 

Even if we are only considering three polynomials, a condition such as 
f i <C f2 <SC T3 is not strong enough to draw any conclusions. The reason is that 
when we form linear combinations, we will need to know the relationship be- 
tween f i and f3 as well. Thus, we assume that fj, T2, T3 is mutually interlacing: 

Definition 3.3. A sequence of polynomials fj, . . . , f n is mutually interlacing if 
and only if for all 1 ^ i < j ^ n we have f i <C f j , and each ft has positive 
leading coefficient. 

The roots of a mutually interlacing have a simple ordering. If we denote 
the ordered roots of f\ by r{ then the roots of f \, . . . , f n are ordered 

r{ < r 2 s$ • • • < s$ < i$ < • • • < < r?s^ • • • < < 

Here are a few examples of mutually interlacing sequences of polynomials: 

Lemma 3.4. 

1. Ifg(x) is a polynomial with roots a\ < • • ■ < a n , then the sequence below is 
mutually Interlacing: 

9 9 9 9 

x — ai' x — Q2' ' x— a n _i' x — a n 
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2. Any subsequence of a mutually interlacing sequence is mutually interlacing. 

3. Iff i, fz, . . . , f n z's a mutually interlacing sequence then the sequences below are 
mutually interlacing 



Proof. The roots of the first sequence of polynomials are 

(a 2 , a 3 ,...,a n ), {a\,a 5 ,...,a n ),..., {a\, . . ., a n _ 2 , a^J, (ai, . . ., Q n _ 2 , a n -i) 
which are interlacing. The remaining assertions follow easily from Corol- 



We first determine when a linear combination of mutually interlacing 
polynomials has all real roots. The answer is simple: the coefficients have 
at most one change of sign. 

Lemma 3.5. Suppose that a\,...,a n are non-zero constants, 

and f i <C • • • <C f tl is a mutually interlacing sequence of polynomials. The following 

are equivalent: 

1. For all possible ft as above Y_ a ih has M rea l roots. 

2. There is an integer k so that all the where i ^ k have the same sign, and all 
at where i > k have the same sign. 

Proof. We can write ft = fic + eirt where ft < th, T\ has positive leading coeffi- 
cient, and ei = — 1 if i < k and et = 1 if i > k. Now 



and by hypothesis the coefficients a^ei all have the same sign. It follows from 
Lemma[O0]that f e P. 

Conversely, consider the three mutually interlacing polynomials 



fl, f 1 + fl, fl, fl + h,..-, f n -l, fn-1 + fn, f- 

fi,fi + f 2 ,fi + h + + h + ■ ■ ■ + fn 

fl, f2, fl, fl,---,fl 



n 



laryOOl 



□ 




gi = (x-8)(x-24) 
g 2 = (x-5)(x-21) 
g 3 = (x-2)(x-13) 



where 91^92^ g3- Observe that 

gi - g2 + g3 



-21x+13^P. 
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Now embed gi, g 2 , 93 in a mutually interlacing sequence so that fj = 
(l/|ai|)gi,fi = (l/|ai|)g 2 , and f n = (l/|a n |)g 3 . Since 

aifi + Q|fi + a n f n = sgu(ai)gi + sgn(a0g 2 + sgn(a n )g 3 

we know that if the left hand side has all real roots then we can not have that 
sgn(ai) = sgn(a n ) 7^ sgn(cii). This implies that there can be atmostone sign 
change. □ 

The following obvious corollary will be generalized in the next section 
where sums are replaced by integrals. 

Corollary 3.6. Suppose that a\, . . . , a n are positive constants, 

and f i -C • • • -C f n is a mutually interlacing sequence of polynomials. Then 

f 1 <Q 1 f 1 + --- + Q n f n <f n (3.2.1) 

Our first theorem is a simple consequence of the general theorems of the 
previous section. 

Theorem 3.7. Suppose i\, . . . , f n are mutually interlacing, and A = (ay ) is an n 
by n matrix of constants such that 

A- (fi,...,f n ) = (gi,...,g n ) 

If A is totally positive, then gi, . . . , g n is mutually interlacing. 

Proof. Since the limit of a mutually interlacing sequence is mutually interlac- 
ing, and any non-invertible totally positive matrix is the limit of strictly to- 
tally positive matrices, we may assume that A is invertible. By the Loewner- 
Whitney theorem we can assume that A is an elementary Jacobi matrix. The 
three classes of matrices applied to fi< ■■■ <f n give three easily verified 
mutually interlacing sequences: 

f 1 • • • fi-l fx + tfi+i fi+l • • • f n 

fl ••• fi-l fi tfi + fi+i ••• f n 

fj • • • fi_i tfi fi+l ' ' ' fn 

□ 

The two by two case for all positive entries follows from the theorem. 
However, we can determine the action of an arbitrary two by two matrix. 

Corollary 3.8. Suppose that fi <c f 2 (or fi < f 2 ) have positive leading coefficients, 
and ( " d ) ( % ) = ( 92 )' wnere without loss of generality we may assume that a is 
positive. If the determinant ad — be is positive then 0,1 <c g 2 , and if it is negative 
92<gi- 
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Proof. Assume that the determinant ad — be is non-zero. If r is a common root 
of gi and Q2 then since the matrix is invertible we conclude that f 1 (r) = f 2 (r ) = 
which contradicts the assumption that fj and f2 have no common factors. 
Now any linear combination of gi and g2 is a linear combination of fi and T2, 
and thus has all real roots. Consequently, gi and g2 interlace. The direction 
of interlacing will not change if we continuously change the matrix ( £ a ) as 
long as the determinant never becomes zero. Consequently if the determinant 
is positive we can deform the matrix to the identity in which case gi <C g2- If 
the determinant is negative we can deform the matrix to ( \ ) , and since 
f — g <c f we are done. □ 



•lb' 
.Id, 



(2) 



f + bg 
f+dg 



If f < g and < b < d then 



Example 3.9. Consider 
f + bg<f +dg. 

The examples of mutually interlacing sequences in Lemma l3~4T 3) are given 
by the following totally positive matrices, where we take n = 3 for clarity and 
the "." are zeros. 



A • A 

1 1 . 

. 1 . 

. 1 1 

• • V 




f 1 ■ 
1 

1 

V- 1 



•7 



An immediate corollary of Corollary l3.8l is that the inverse of a totally pos- 
itive matrix sometimes preserves interlacing. 

Corollary 3.10. If 

1. Mis a 2 by 2 totally positive invertible matrix with non-negative entries 

2. f, g,u, v 6 P have positive leading coefficients 

3. M( g ) = m 

then f <c g if and only zfu<C v. 

Remark 3.11. The inverses of the 3 by 3 Jacobi matrices (except the diagonal 
ones) do not satisfy the conclusions of the lemma. However, we do have that 
the matrices of the form 

1 -aS 
1 
-b 1 

satisfy the conclusions if ^ ab < 1. That is, if ^ a, b and $5 ab < 1, 
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f , g, h. is mutually interlacing, and f, g, h, u, v, w have positive leading coeffi- 
cients then u, v, w is mutually interlacing. This is equivalent to showing that 

f-ah<g g<-bf + h f— ah<h— bf 

The first follows from f <c g <f; h. For the second, write f = ag — r, h. = 8g + s 
where g <^r, s. Then g^C ((3 — ba)g + s + br since the leading coefficient of 
the right hand side is positive by hypothesis. Finally, the last one is a two by 
two matrix ( _\, i a ) ( k ) ~ ( H-bf ) • ^ rie l emma shows that the interlacing is 
satisfied if the determinant is positive, which is true since ^ ab < 1. 

Surprisingly, we do not need the full force of total positivity to conclude 
that a matrix preserves interlacing, but just the non-negativity of all two by 
two subma trices. 

Theorem 3.12. Suppose that A = (qj, . . . , a n ) and B = (bi, . . . , b n ) are vectors 
of positive real numbers, and f = (fi, ...,f n ) is a vector of mutually interlacing 
polynomials. Then 

Af <f;Bf for all f if and only if (g) is totally positive 2 . 

Proof. Consider the mutually interlacing set of polynomials 

Cif i, C 2 f 2 , . . . , Cnfn 

where the Ct are positive. By continuity of roots, if we set Ci = Cj = 1, and all 
other c's to zero, then 

Qifi + Qjfj^bifi + bjfj. 

M - fh 



We can rewrite this as ( b ^ b ! J ( f M = ( k ) where h^k. By Corollary l3.8l we 



must have b J b ' > 0. 

Conversely, given A and B construct a totally positive n by n matrix C 
whose first row is A, second row is B, and whose remaining rows are all 0. If 
we apply C to the mutually interlacing sequence f, g, g, • • • , g the conclusion 
follows from Theorem |3.7| □ 

A small modification of this argument shows 

Corollary 3.13. Suppose fj, ...,f n are mutually interlacing, and A = (ay) is an 
nbym matrix of constants such that 

A- (fi,...,f n ) = (gi, • • • , g m ) 

If A is totally positive2, then gi, . . . , g m is mutually interlacing. 

If we fix the constants rather than the polynomials, then we need a deter- 
minant condition on the polynomials. 
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Lemma 3.14. Suppose that we have the following interlacing diagram 



< h 



A 



A 



gi < 92 < 



< fn 

A 

< gn 



where 



1. The ft are mutually interlacing. 

2. The gi are mutually interlacing. 

3. | £ | < 0/or 1 < i < n 

then for any non-negative <x\, . . . , cc n we have Y (Xjt~i<C ^ ai gi 



Proof. We may assume that all at are positive. We proceed by induction. The 
case n = 2 is Lemma 11.511 let hi = od\ + ourfi and ki = ctgi + a^Qi then 
by the lemma we know that hi <c ki and {hi, i^, ■ ■ ■ , f n } and {ki, Q2, ■ ■ ■, Qn} are 
mutually interlacing. The determinant condition is satisfied since 



hi 


f 2 




aifi 


+ <x 2 h 


h 




fl 


h 


ki 


92 




aigi 


+ U-292 


92 


= Otl 


91 


92 



< 



We continue combining terms until done. 



□ 



Mutually interlacing sequences have a nice interpretation when expressed 
in the interpolation basis. Suppose f i, . . . , f n is a sequence of mutually in- 
terlacing polynomials of degree d with positive leading coefficients. All the 
smallest roots of f 1, . . . , f n are smaller than all the second smallest roots of 
fi/ ■ • • / fn/ and so on. Thus, we can always find a g such that g < fi for all i. It 
follows that there are non-negative constants ay such that 



If we set A = (ay), F = (fi, . . ., f n ) and G 



then AG 



The next lemma characterizes those A's that always yield mutually interlacing 
sequences. 

Corollary 3.15. Suppose G and A are as above. For all choices ofG the sequence AG 
is mutually interlacing if and only if A is totally positive2- 

Proof. We simplify what needs to be proved. First of all, it suffices to assume 
that A has only two rows since the conclusion only involves two by two ma- 
trices. Thus we will show that if for all g we have 
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then | c{ ci+J I ^ 0. Next, notice that we can multiply any column of A by any 
positive number, and the determinants will still be non-negative. By continu- 
ity we can set all other columns to zero, and so we assume that 

g(x) . gW ^ g(x) g(x) 
cm. 7- + ai+i — <c t — + Ci + i- 



bi x-bi+i x — bi x-bt+i 

Next, we choose g = x n ~ 2 (x — a)(x — |3) where bi = a and bi+i = (3, and 
(3 > a. We can cancel the powers of x, and so we have 

di(x— |3) + a i+ i(x — a)<Ci(x— |3) +Ci + i(x— oC] 

The root of the left hand side is not less than the right hand side, so 

Ci + ia+Ci|3 at+joc + ai|3 

Ci + Ct+l ^ CLi + 

and from this we find that 

(a- (3)(a i+1 Ci - OiC l+ i) > 

which is the desired conclusion. 

The converse follows from Corollary l3.13l since the set | j is mutually 
interlacing. □ 

The following property of mutually interlacing polynomials general- 
izes Lemma 11.191 and will be extended to integrals in the next section 
(LerrtmaElJ. 

Lemma 3.16. Ifi\, ■ . . , f n and gi,...,g n are two sequences of mutually interlacing 
polynomials with positive leading coefficients, then 

f 1 9n + h Qn-i H h f n gi fc i ■ 

Equivalently, ^fi(x)gi(— x) e P. 

Proof. Assume the f 's have degree r and the g's have degree s. We may assume 
that the roots are distinct. It is helpful to visualize the location of these roots 
in two dimensions, so we plot the roots of ft and g n +i-i with y coordinate i. 
For instance, if r = 3, s = 2, n = 4 then a possible diagram of the roots is 

The roots of the f 's lie in three increasing groups, because each fi has three 
roots, and the fi are mutually interlacing. Since the g's occur in the opposite 
order, their groups are decreasing. In addition, since the f's are mutually in- 
terlacing, each group of four roots on a line does not vertically overlap any 
other group of four roots. 

In this diagram the dotted lines represent values of x for which each of the 
terms f i g n , • • • , f n gi have the same sign slightly to the left of x. To see this, 
consider a term fi g n +i-i- At any value x and sufficiently small e the sign of 
fi(x — e) 9n+i— i(x — e) is (—1) to the number of roots to the right of x. The 
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Figure 3.1: Roots of mutually interlacing polynomials 



doted lines arise at the intersection of upward and downward lines, or at gaps 
where there are no up or down lines. 

The general case is no different. There are n levels, r upward lines, and s 
downward lines. It is easy, but notationally complicated, to see that there are 
r + s + 1 vertical lines where all signs are the same. Between each pair of signs 
there is a root of the sum, accounting for all the roots. 

The final statement follows from the first since the interlacing direction of 
{gi(— x)} is the reverse of the interlacing direction of {gt(x)}. □ 

3.3 Determinants and mutual interlacing 

If f i, . . . , f d is a sequence of mutually interlacing polynomials of degree n, 
and f i = Y. a i,j x ' men me matrix (cid-ij) is called the matrix of coefficients of 
f i, . . . , f d- If the matrix of coefficients of a sequence of mutually interlacing 
polynomials is TP k (or totally positive), we say that the sequence is TP k (or 
totally positive). We show that any mutually interlacing sequence is TP 2 , but 
not necessarily TP 3 . In addition, we show that certain mutually interlacing set 
of polynomials are totally positive. 

Example 3.17. Here are three mutually interlacing polynomials 

fi = (2 + x) (9 + x)(14 + x) 
f 2 = (5 + x)(10 + x)(15 + x) 
f 3 = (6 + x)(12 + x)(18 + x) 

The matrix of coefficients is 

1296 396 36 1\ 
750 275 30 1 
252 172 25 l) 

It's easy to see that it is TP 2 , but the four three by three determinants are all 
negative, so it is not TP3 . 

We first have two useful facts. 

Lemma 3.18. 
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1. Suppose that C is the matrix of coefficients ofV = (f i, f2, • ■ ■ , f n )/ an d M z's a 
compatible matrix, then the matrix of coefficients of MV z's MC. 

2. f/g e pv° s r an d the mutually interlacing sequence V = (fi, . . . , f d) is totally 
positive, then the sequence (g fi, g f 2 , • • • , g f n ) is a/so totally positive. 

Proof. The first is immediate from the definition. For the second, since we can 
factor g into positive factors, it suffices to assume that g = x + a, where a > 0. 
If f i = Y_ a i xl an d f 2 = Y. bt* 1 men tne matrices of coefficients satisfy 



a ai a 2 
b bi b 2 



/a 1 

a 1 
a 1 

V 



7 
'aao 
a b 



a ai + ao a Q2 + ai 
abi + b ab 2 + bi 



The first matrix is the matrix of coefficients of V, the second is totally pos- 
itive, and the third is the matrix of coefficients of (x + a)V. The conclusion 
follows. □ 

If f = Yl (x + n) where r\ < T2 < • • • < r n then we say that the sequence 

f(x) f(x) f(x) 



x + r n x + r n _i 



x + Tj 



of mutually interlacing polynomials is determined by f . 

Proposition 3.19. If V = (fi, . . . , f n ) is a sequence of mutually interlacing polyno- 
mials determined by f e P pos , and M is a totally positive matrix, then the matrix of 
coefficients of MV is totally positive. 

Proof. Since the matrix of coefficients of MV is M times the matrix of coeffi- 
cients of V, it suffices to show that V is totally positive. We thus may assume 
that ft = f/(x+r n+ i_i), where f(x) = (x+r\) ■ ■ ■ (x+r n ) andO < r\ < ■ ■ ■ < r n . 
The matrix of coefficients is an n by n matrix. Consider the submatrix deter- 
mined by a set R of d rows and a set of d columns. Define 

g{x) = Yl[x + ry.) h{x) = Y\(x + r v ) 

k0R kGR 

Note ft = g(x) • h(x)/(x + n). We want to prove that all the determinants of 
size d by d are positive for the following set of polynomials 



{g(x)-h(x)/(x + rO IkeR} 
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By the lemma, it suffices to show that the determinant of the matrix of coeffi- 
cients of 

{h(x)/(x + r k ) |keR} 

4 and R = {1,2,3,4} then the matrix of coeffi- 



is positive. For instance, if d 
cients is 

/ T 2 T3 T 4 

1*1 r 3 r 4 



n r 2 r 4 
\ri r 2 r 3 



r 2 r 3 + r 2 r 4 + r 3 r 4 
n r 3 + ri r 4 + r 3 r 4 
rj r 2 + rj r 4 + r 2 r 4 
n r 2 + n r 3 + r 2 r 3 



r 2 + r 3 + r 4 
T\ + r 3 + r 4 
ri + r 2 + r 4 
ri + r 2 + r 3 



i 
i 

V 



First of all, the determinant is a polynomial of degree (2) in the r^. If = r, 
then two rows are equal, and the determinant is zero. Thus, n — r j divides the 
determinant. Consequently, 



A 



it 



divides the determinant. But this is a polynomial of degree (2), so the de- 
terminant is a constant multiple of A. We can check that that the constant of 
proportionality is 1. Since < rj, all terms of the product in A are positive, so 
the determinant is positive. 

□ 

If the matrix of coefficients of a sequence V of mutually interlacing poly- 
nomials is not totally positive, then V is not obtained by multiplying the se- 
quence determined by the factors of some polynomial by a totally positive 
matrix. For instance, the polynomials in Example 13 . 1 71 form such a sequence. 

Lemma 3.20. Iff 1, . . . ,f & is a sequence of mutually interlacing polynomials in P v ° s 
then the matrix of coefficients is TP 2 . 

Proof. It suffices to prove that all two by two determinants are non-negative 
for f <g. Write g = af + Y. a if/( x + r i) where f = YI( X + r i)' and all ck are 
non-negative. If we write this equation in terms of matrices 



1 

a oci a 2 



( f \ 

f/x + T n 



we see the matrix on the left is totally positive, and the middle matrix is totally 
positive by the Proposition and taking limits, so the conclusion follows. 

□ 



We will prove this same result in a different way in Corollary 14.311 If we 
apply the Lemma to xf and f where f = Y. a i xl G P pos , then the matrix of 
coefficients is 

'do ai a 2 
dp ai 
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The two by two determinants show that a| ^ a^-ia^+i. This is a special case 
of Newton's inequality (p. 11061 . 

3.4 Interlacing polynomials and the Hurwitz matrix 

A pair of interlacing polynomials determines a totally positive matrix. We 
prove it inductively; the key step is given in the following lemma. 

Lemma 3.21. Define T c (f, g) = (xg + cf, f ), and 

D — {(f, g) | f < g or f -C g in P v ° s with positive leading coefficients }. 
1-1: T c : D — > D is one to one. 

onto: Given <x&D there is a positive c and |3 € T> such that T c ( (3 ) = a. 

Proof. The first part is clear. For the second choose (h, k) e D, and set c = 
h(0)/k(0) and g = (h — ck)/x. Since h and ck have the same constant term, g 
is a polynomial. If h < k then gx = h — ck < k so T c (k, g) = (h, k) and k<C g. 

If h <C k then write h = ak — I where a is positive and h < i and follow the 
previous argument to conclude that k < g. 

□ 

If we iterate the lemma, we see that we can find positive Ct and positive b 
such that 

(f,g)=T Cl •••T Cn H(b,0). (3.4.1) 

This factorization can be turned into a factorization of matrices. If f = Y. ck*^ 
g = Y. biX 1 and we define 



fc 


i 














..A 




/ Qo 


Ql 


Q 2 




\ 








1 


















bo 


bi 


b 2 ... 










c 


1 










H(f,g) = 





a 


Ql 


a 2 ... 
















1 













bo 


bi b 2 
















c 


1 












Qo 


CLi Q 2 




V 












) 




V : 











(3.4.2) 

then T(c)H(f, g) = H(xg + cf, f). The matrix H(f, g) is sometimes called the 
Hurwitz matrix. 

Proposition 3.22. 2ff < g inP pos have positive leading coefficients then 

1. H(f, g) is totally positive. 

2. There are positive ct and b such that 



H(f,g) =J(ci)---J( Cl JH(b,0). 
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Proof. Since all J(c) and H(b,0) are totally positive, the first statement is a 
consequence of the second. The second is a consequence of the factorization 
1 13.4. It and the lemma. □ 

Remark 3.23. If we write out the matrix determined by f < f ' 



/ Qi 2Q2 3d3 

a ai a 2 0.3 ... 

ai 2a2 3a3 . . . 

ao ai Q2 Q3 



\ 



/ 



then we see that I 



^ 0. This is equivalent to 



2 . lc + l 
u k > — ; — a k _ia k+ i 



and this is a weak form of Newton's inequality (See p. 11061 ). 

We will prove the following corollary in a very different way later - see 
Theoremi2H] 

Corollary 3.24. Ifi — Y. Q i xl zs ZM P pos then the matrix below is totally positive. 

^q a! q 2 . . ^ 

a Qi a 2 
a ai 



Proof. Apply the proposition to f <f and select the submatrix whose rows 
come from f . □ 



3.5 The division algorithm 

We study the division algorithm applied to interlacing polynomials. The basic 
result is the following lemma: 

Lemma 3.25. Suppose that f, g have positive leading coefficients, and f < g in P v ° s . 
If we divide f by g 

f = (ax + b)g — r where deg(r) < deg(g) 

then 

1. g<r. 

2. re P vos 

3. r has positive leading coefficient. 
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4. deg(r) = n - 2 

Proof. Since f sign interlaces g, it follows that r sign interlaces f . The degree 
of r is less than the degree of g so we have that g < r. The interlacing is strict, 
since any common factor of g and r is a common factor of f and g. Since f 
strictly interlaces g, they have no common factors. Since g G P pos , so is r. 

Now we determine the leading coefficient of r. We may assume that f, g 
are monic, so 

f = x n + aix^ 1 + a 2 x n - 2 + ■■■ 

g = x™- 1 + b lX n - 2 + b 2 X n ~ 3 + ■ • • 

r = (x + ai — bi)g — f 
= (a 1 b 1 +b 2 -b 2 1 -a 2 )x n - 2 + --- 
Using Proposition 13 .221 and reversing f, g we see that 



s$ 



1 bi b 2 

1 di d2 

1 bi 



dibi + b 2 — b 2 — d2 



Because r interlaces g the degree of r is n — 2, and so the leading coefficient is 
positive. □ 

Remark 3.26. If we apply the division algorithm to f < f ' where f e P pos then 
all coefficients of r must be positive. The resulting inequalities are either New- 
ton's inequalities, or simple consequences of them. 

It's curious that the lemma does not depend on the positivity of f and g. 

Lemma 3.27. Suppose that f, g have positive leading coefficients, and f < g in P. If 
we divide f by g 

f = (dx + b)g — r where deg(r) < deg(g) 

then 

1. g<r. 

2. r has positive leading coefficient. 

3. deg(r) = n-2 
Proof. If we write 

n n 



where oct ^ then a computation shows that the coefficient of r is 

- t — ~ 1 _ L ^ X ata S (ti ~ r i )2 

(«! + •••+ a n ) 2 f- 



□ 
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Example 3.28. We can apply this in the usual way to solve Ff — Gg = 1. For 
example, 

f = (x + l)(x + 3)(x + 6) 

g = (x + 2)(x + 4) 

f =(x + 4)[g] - [5x + 14] 

and recursively solving we get 

1 = 39 + ^ + 5x2 [(x + 2) (x + 4)] - [( x l!(x : 3)(x : 6] 




Notice that in this example F and G are in P pos . In general, if f has degree n 
then the algorithm yields F, G with deg(F) = deg(G) — 1 = deg(f) — 2. 

Lemma 3.29. J/f < g in P pos , deg(F) = deg(G)-l = deg(f)-2, and fF-gG =1 
then 

1. F,G e P vos . 

2. f < g, G and g, G < F 

Proof. If a is a root of f then g(a)(G(a) = 1 so G and g have the same sign 
at roots of f. Since deg(G) = deg(f) — 1 it follows that f < G - in particular, 
G e P. The same arguments show G < F and g < F. All interlacings are strict 
since any common divisor divides 1. □ 

The identity fF — gG = 1 is known as the Bezout identity. 
Corollary 3.30. Iff < g in P v ° s then there exist F, G e P poB such that 

1. fF-gG = 1 

2. f < g, G and g, G < F 

Restated in terms of the matrix ( £ | ) 

Corollary 3.31. A strict interlacing pair (f, g) in P v ° s can be extended to an inter- 
lacing matrix of determinant 1. 

Example 3.32. If we assume that the anti-diagonal elements are (x — a) (x — b) 
and 1 then there are exactly two solutions to the Bezout identity: 

■ x _ a + b±V(a-b)^ (x _ Q)( x _ b) 

j x _ a+b±V(a-b) 2 -4 

Here is a family of matrices satisfying the Bezout identity 

x + 2 kx 2 + 2kx + 2x + 3 
1 kx + 2 
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Remark 3.33. If M is a matrix as in the lemma, then M and M _1 both map in- 
terlacing polynomials to interlacing polynomials. (See p. |83j) This is different 
from linear transformations on one polynomial, where the only linear trans- 
formations T for which T and T~ : map P to itself are affine transformations. 

Next we show that all elements of SI_2(R) can be realized by evaluating 
polynomials. 

Lemma 3.34. If I " ^ I —1 then there are f, g, h, k e P{n) such that 



f g 

h k 



fUO) g(0)\ fa b 
\h(0) k[0)J \c d 



Proof. Define 

S is the set of all ( 7 ? ) such that | 7 ? | =1 and all polynomials in 
the same row or column interlace. 

Note that if | £ ^ | = \ anc [ polynomials in one row or column interlace then 

( h k ) e ^- ^ e now c l a i m mat multiplication by ( p 1 ° a J and ^ * ma P ^ 
to itself. For instance, 

a \ /f g\ _ / af ag 
(3 1/aj ^h kj _ ^|3f + h/a (3g + k/a 

The resulting matrix has determinant one, and one interlacing row, so it is in 
S. 

From Corollary I3.30l we know that S is non-zero, so we choose ytt) m S. 

The element f ^ is in SL 2 (M). Since SL 2 (R) is generated by upper and 

lower triangular matrices as above, all elements of SI_2(]R) have the desired 
representation. □ 

Following the usual matrix representation of the Euclidean algorithm we 
get a unique representation for interlacing polynomials. 

Lemma 3.35. If f < g have positive leading coefficients then there are unique positive 
a\, . . . , a n and positive a such that 

f \ / a n x + b n — 1\ / aix + bj — 1\ / 1 
a 



v gy V 1 / V 1 / v° 

Proof. Simply note that if f = (ax + b)g — r then 



□ 
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3.6 Integrating families of polynomials 

We can extend the concept of a mutually interlacing sequence f \, . . . , f n of 
polynomials to an interlacing family {ft}, where t belongs to some interval. 
Most results of section [3~3l have straightforward generalizations, where sum- 
mation is replaced by integration, and totally positive matrices are replaced 
by totally positive functions. 

We first introduce a quantitative measure of root separation, and use it to 
integrate families of polynomials. If f G P, then 6(f) is the minimum distance 
between roots of f . If f has repeated roots then 6(f) = 0, and so f has all distinct 
roots if and only if 6(f) > 0. It is easy to see that an equivalent definition is 

6(f) = largest t such that f (x) < f (x + t) 

It easily follows from the definition of 6(f) that 

O^s^tsC 6(f) =>■ f (x + s) < f (x + 1) (3.6.1) 
This last property is the motivation for the following definitions. 

Definition 3.36. Suppose that I is an interval, and for each t S I we have a 
polynomial ft(x) with all real roots such that the assignment t i — ^ ft is con- 
tinuous. We say {f t } is a continuous family of polynomials. If f t -C f s for every 
t ^ s in I, then {f J is an interlacing family . The interval of interlacing at a point v, 
written p(f t/ v), is the largest e such that f v -C f « for all s G (v,v + e). If there 
is no such e we set p(f t ,v) = 0. A continuous family of polynomials {ft} is 
locally interlacing if p(f t , v) is positive for all v in the interior of I. The interval 
of interlacing of a locally interlacing family, p(ft), is the largest e such that if 
s,v G I and v < s < v + e, then f v <Cf s . If there is no such e then the interval 
of interlacing is 0. 

The following is a generalization of Corollary |3.6l 

Proposition 3.37. Suppose {f t } is a continuous family of interlacing polynomials on 
[a, b], and that w(t) is a positive function on [a, b]. The polynomial 



VW 



b 



f t (x)w(t)dt (3.6.2) 



has all real roots, and fq^Cp^Cfb- 

Proof. Choose a = to < tj < • • • < t n = b, and consider the function 
s(x,t) =ftoMw>(t)xrt ,t 1 ] H l-ft n _ 1 Ww(t)x[t n _ 1/ t n ] 

where X[c,d] is the characteristic function of the interval [c, d]. It suffices to 

show that the polynomial s(x, t) dt has all real roots, since 113.6.21 is the 
limit of such functions. Note that 



b 



s(x,t) dt = f 



in 



w(t) dt + • • • + f t „ 



w(t) dt 
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Each of the coefficients of the polynomials f ti is positive, and the set of poly- 
nomials ft , . . -/ft^ is mutually interlacing since {ft} is an interlacing family, 
so s(x, t) dt has all real roots by Lemma [3741 □ 



Example 3.38. Since 6(x] n = 1/ if we let w(t] = 1 then (x_+_t) (n) dt is in P. 
More generally, for any w(t) that is positive on (0, 1) we have 



(x±t) (n) w(t)dt<(x + l) (n) . 



Example 3.39. Given any mutually interlacing sequence of polynomials we 
can find an interlacing family that contains it. Suppose that f 1 < ■ ■ ■ < f n is 
a mutually interlacing family. Define an interlacing family on (l,n) such that 
I i ft by 

F t = (i + 1 - t)fi + (t - i)f i+1 for i < t < i + 1. 



We can easily compute the integral 

n -i r i+l 



Ft dt = Y_ 



(i + l-t)fi + (t-i)f l+1 dt 



i=l " 
n-l 



i=l 
= |fl + f 2 



• • + fn-1 + ^ f r 



Here is a similar example, where the sequence of mutually interlacing 
polynomials is determined by the roots of a fixed polynomial. Suppose that 
f(x) = (x — ti) • • • (x — r n ) where ri < • • • ^ r n , and let ft = f/(x — ri). We 
know that f n <C • • • <C f i is a mutually interlacing sequence. Define an inter- 
lacing family on (ri, r n ) by 



Ft = 



f(x) 



(x - n - r i+1 + t) on Tt < t r i+ i 



(x-rtjix-n+i) 

This family satisfies F Ti = ft. The integral is equal to 



n-l 



Ft dt = Y_ 



i=l 
n-l 



f(x) 



x-rt)(x-r i+1 J 
f(x) 



(X - Tt - T i+ i + t) 



t=l 



(x-rt)(x-r i+ i] 



(x-(n+r i+a )/2) 



The integral interlaces f, since the integral is a positive linear combination of 
polynomials interlacing f . 
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In neither of these examples is F t differentiable; the constructions involve 
piecewise linear functions. In Lemma [l 1 . 1 8l we will find a differentiable family 
for the sequence in the second example. 

The next result is the continuous analog of Lemma l3.14l The proof is omit- 
ted. 

Lemma 3.40. Suppose that f t and g t are interlacing families on [a, b]. If 
1. f t (x)<g t (x)/oraZZa<t<b 



2. 

then 



f(s) f(t] 
g(s) g(t) 



< Ofor a < s < t < b 



f t (x) dt< 



g t (x) dt 



The conclusion of Proposition 13.371 was only that p weakly interlaced f Q 
and fb, since the limit of interlacing polynomials is only weakly interlacing 
polynomials. The next lemma is used to deduce strict interlacing for integrals. 

Lemma 3.41. Suppose {f t } z's a continuous family of interlacing polynomials on 
[a,b], and that w(t) is a positive function on [a,b]. Choose C\,Ci such that 
a < ci < C2 < b and consider the the polynomials 



Pi 

P2 



f t (x) w(t)dt 



f t (x) w(t)dt 



Then, f a <pi <P2 <f b . 

Proof. From the previous theorem and the hypothesis we know that 

fa<Pl<f Cl <fc 2 <P2<fb 

Since {ft} is interlacing on [a, b], we see that pi -C P2- 



□ 



We can integrate locally interlacing families to get new ones. We first hold 
the interval constant. 

Proposition 3.42. If {f t } is a locally interlacing family on [a,b], w(x) positive on 
[a,b], andr ^ p(f)/2, then 



QsW 



s+r 



f t (x)w(t) dt 



z's a locally interlacing family on [a, b — r] with p(g s ] ^ r. 



CHAPTER 3. MATRICES THAT PRESERVE INTERLACING 



78 



Proof. Certainly all members g s have roots, so it remains to show that there 
is local interlacing. Consider g s and g s +e/ where < e ^ r. By the previous 
lemma, the three integrals 



Pi 

P2 
P3 



ps + e 

f t (x)w(t) dt 

s 

■s+r 

f t (x)w(t) dt 

s + e 
■s+r+e 

f t (x)w(t) dt 

S+T 



satisfy pi <p 2 <P3 and pi <p 3 . Since g s 
follows that g s «g s+e . 



pi +p 2 , and g s+e 



P2 + P3, it 
□ 



Next, we vary the domain of integration. 

Lemma 3.43. J/{f t (x)} is a locally interlacing family on 
function, and 



w(x) a non-negative 



. f t (x)w(t)dt s>0 
9sW = \ Jo 

,fo(x) s=0 
then {g s } is a locally interlacing family with p(g) > p(f ). 

Proof. We need to show that g u < g v for s$ u v ^ p(f). If we consider an 
approximating sum for g v then all the terms are mutually interlacing, which 
easily implies the result. □ 

If the weight is a function of two variables, then integrating a locally inter- 
lacing family against the weight gives a new family whose members all have 
roots. However, we need an extra condition to guarantee that it is locally in- 
terlacing, and total positivity is what we need. 

Lemma 3.44. Suppose that {f t } is a family of interlacing polynomials on [a, b], and 
u(x),w(x) are positive on [a, b]. Set 



Pu 

Pw 



f t (x) u(t) dt 



u(s) u(t) 
w(s) w(t) 



f t (x)w(t) dt. 
is positive for all a ^ s < t ^ b, then p 1L <c Pi 
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Proof. For any a < ti < ■ ■ • < t n < b, the approximations to p u and p w are 

n-l 



Now since 



tt+i 



w(t) dt 



z 

i=l 
n-l 

z 

1=1 



ti+1 



u(t) dt f ti 



Li 



w(t) dt f tl 



t)+i 



uft) dt 



■ti+i 



•tj+i 



w(t) dt 



tt+i 



uft) dt 



(w(t)u(s) - u(t)wfs)) dt ds > 



and the sequence of polynomials f tl , . . . , f trl is mutually interlacing, the result 
follows from Theorem l3.12l Note that we can allow w and u to have infinitely 
many zeros on [a, b] since all the integrals will still be positive. □ 



Definition 3.45. A function w(s, t) is strongly totally positive of order 2 (or STP2) 
on an interval [a, b] if for all t < t', s < s' in [a, b] the determinant 



w(s,t) w(s,t') 
w(s',t) w(s',t') 



and all its entries are positive. For example, e xy is STP2. 

Corollary 3.46. Suppose that {f t } is an interlacing family on [a, b], and that w(s, t) 
z's STP2. The family of polynomials 



f t (x)w(t,s) dt 



z's a interlacing family. 



Proof. The lemma shows that it is locally interlacing, and it follows that it is 
an interlacing family since f a ^gs fCfb- d 



As an example, consider the family 



QtW 



Since 6(x + y) — 1 and e ty is STP2 we know that gt is an interlacing family. 
Since gt is defined for all t, we have an interlacing family on M. 
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3.7 Families, root zones, and common interfacings 

In this section we consider families of polynomials that are indexed by a set S. 
We say that a family {f-tjtes has a common interlacing if there is a g £ P such 
that f t < g for all t € S. The basic question is 

What properties do we need in order to have a common interlac- 
ing? 

Here's an example. Consider the family {F t } t ^o where F t (x) = f (x) + tg(x) 
and deg(f) = deg(g) +1. If F t € P for all positive t then there is a common 
interlacing. This is Proposition ll.351 

It is useful to consider all the zeros of a family. Suppose that {f t }tes is a 
family of polynomials of degree n. The k'th root zone is 

A k = {w e K I 3t e S & w is the k'th largest root of f t } 

Since we assume that families are continuous, it follows that if S is con- 
nected then all the root zones are intervals. The following is elementary, but 
points out how root zones can be useful. 

Lemma 3.47. If a family F consists of polynomials in P of degree n, and all root zones 
Ai, . . . , A n are disjoint then F has a common interlacing. The family has a common 
interlacing iff the intersection of any two root zones has at most one element. 

Proof. Pick the roots of a common interlacing to lie between consecutive root 
zones. □ 

Corollary 3.48. If any two polynomials of a family have a common interlacing, then 
the family has a common interlacing. 

Proof. The hypotheses imply that all elements in A k are less than or equal to 
all elements of A k+ i. □ 

Example 3.49. The root zones of the family {f + tg} t ^o mentioned above have 
a simple description. Suppose that f and g have degree n, and let roots (f) = 
[xx] and roots(g) = (si). Since f, g have a common interlacing, it follows that 
the root zones are 

A k = [min(r k ,s k ), max(r k; s k )]. 

If all members of the family {f + ag} a ^o have all real roots then the family 
has a common interlacing. We can rephrase this in a way that leads to an 
important generalization: 

If |3 2 f + g G P for all real (3 then {|3 2 f + g}peR has a common inter- 
lacing. 

The next lemma adds a linear term. 

Lemma 3.50. Suppose that f, g, h are polynomials with positive leading coefficients 
such that deg(f) = deg(g) = deg(h) + 1 
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If |3 2 f + (3h+ g G P for all real |3 f/ien {(3 2 f + (3h + g}p S K /zas a common 
interlacing. 

Proof. Define F(x, (3) = |3 2 f + |3h + g. If deg(f) = n then for each (3 the polyno- 
mial F(x, (3) has exactly n roots, since F(x, (3) has degree n for all (3. If rt((3) is 
the i'th largest root of F(x, (3) then the continuous function (3 i — > Ti ( [3 ) is called 
the i'th solution curve. 

The i'th solution curve meets the x-axis in the i'th largest root of g since 
F(x,0) = g. In addition, lirri|p|^ 00 Ti(P) is the i'th largest root of f. Thus, the 
i'th solution curve is asymptotic to the vertical line through the i'th largest 
root of f, and meets the x-axis in the i'th largest root of g. 

Consideration of the geometry will finish the proof. For any xo there are 
at most two points on the graph with x-coordinate Xq. There can be no verti- 
cal line that meets two solution curves, for then there would be at least four 
intersections on some vertical line. See Figure |3~51 where the dashed lines are 
the vertical asymptotes. Thus, the root zones are separated, and we can apply 



Corollary 3.51. Suppose that f , h are two polynomials satisfying 

1. f G P has positive leading coefficient. 

2. h has degree less than n, and does not necessarily have all real roots. 

3. f+ahe Pfor O^cc^l 

then {f + ah}o^ as ;i has a common interlacing. 
Proof. Consider the following 

f + aheP if |a| ^ 1 



Lemma 1^471 




Figure 3.2: Impossible solution curves 



□ 




if |3 e R since 



|3 2 + 1 



2P 



< 1 



(3 2 f + 2(3h+f G P 
We can now apply the lemma. 



if |3 G R 



□ 
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3.8 Two by two matrices preserving interlacing 

When does a two by two matrix with polynomial entries preserve interlacing? 
First, a precise definition. 

Definition 3.52. Suppose that M is a two by two matrix with polynomial en- 
tries, and M ( \ ) = ( r s ). Then 

• M preserves interlacing if f < g or f <c g ==> r<sorr<Cs. 

• M preserves interlacing for < if f < g => r < s or r <C s. 

• M preserves interlacing for <c if f <C g => r<sorr<Cs. 

• M weakly preserves interlacing if all interlacings are < or <C ■ 

A number of earlier results can be cast as matrix statements. Assume that 
a, (3 are positive. 

f < — g ==>■ af < — |3g (op) preserves interlacing. 

f < — g =>■ xf < — xg ( o x ) weakly preserves interlacing. 

f < g xf — ag < (3f ( p "o" ) preserves interlacing for <. 

f < g xg + af < |3g (op) preserves interlacing for <. 

We can combine these matrices using the following lemma: 

Lemma 3.53. Suppose that u, v, w are vectors of length two. If{^) and ( ^ ) preserve 
interlacing, then for any positive a, (3 the matrix ( av +pw ) preserves interlacing. If 
one of them weakly preserves interlacing, then the conclusion still holds. 

Proof. Let f <— g, $ = (*), (^) 4> = (\) and (^)4> = {{). Then 
( av+pw ) 4> = ( as+pt )• Since r < — s and r < — t and at least one of these 
is strict, the result follows by adding interlacings. □ 

Corollary 3.54. Suppose that a, |3 are positive. 

2- (op) preserves interlacing. 
2. ( £ ° ) preserves interlacing. 

3- ( o ~x" ) preserves interlacing. 

4. ( p ~ x a ) preserves interlacing. 

Proof. The two matrices on the left satisfy the conditions of the lemma, so the 
matrix on the right preserves interlacing: 

Ug)&(83) ^(op) 
(52) & (£8) =>(i2) 
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In the next one, the second matrix represents the interlacing exg — ccq <C q, 
which follows from exg ^ xg < g. We then take the limit as e — > + . 



The condition that a matrix preserves interlacing is not much more than 
invertibility. Note that the signs of the leading coefficients are not relevant. 

Lemma 3.55. If the two by two matrix M. is invertible for all x, and there are poly- 
nomials r < s and u < v such that M. ( r s ) — { y ) then M preserves interlacing for 
<. 

Proof. Assume that there is an a such that u(oc) = v(<x) = 0. Since M evaluated 
at a is invertible, it follows that r(ct) = s[oc) — 0. Since r and s have no roots 
in common, it follows that u and v also have no roots in common. Thus, 
whatever the order the zeros of u and v, it is the same order for all u and v. 
Since there is one example where they interlace, they always interlace. □ 

Proposition 3.56. Suppose that M. = (/ 9 ) is a matrix of polynomials with positive 
leading coefficients. M preserves interlacing if and only if 

1. f — ► g, h — > k, and both interlacings are strict. 

2. The determinant | f 9 | is never zero. 

Proof. Assume that M preserves interlacing, and let M ( r s ) — ( ^ ) . If r <c s 
then fr + gs e P. It follows from Lemma [1.381 that g^f. If g and f had 
a common factor, then u and v would have a common factor. Thus g <C f • 
Similarly, k <c h. Suppose that p is a root of |M|. It follows that ^(p)) = (o) 
which contradicts the hypothesis that u and v have no roots in common. 

Conversely, by Leibnitz (Lemma |1.26l l both u = fr + gs and v = hr + ks 
have all real roots. By the preceding lemma, it suffices to show that there is 
one example where the roots of u and v interlace. For this purpose, choose 
r = g and s = f and apply Lemma [1.321 

The cases with < are similar and omitted. □ 

Corollary 3.57. Iff <g and the absolute value of the leading coefficient ofi is greater 
than the absolute value of the leading coefficient ofg then ( ~ f 9 j preserves interlac- 





The next one follows from previous ones: 




□ 




ingfor <. 
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Proof. Since f < g the determinant is always positive. The following interlaces 
by Lemma [1.181 and so proves the corollary: 



f -fl f 
9 f 9 



f 2 -g : 
2fg 



□ 



If we pick f -c g then 

2x x 



+ 1 0. Consequently, ( g 9 , f f , J pre 



Example 3.58. 

serves interlacing. For instance, ( 2 ^ 1? ) weakly preserves interlacing. 

Example 3.59. Ifwe expand (x+i) n we can write it as f n (x) + g n (x)i where f n 
and g n have all real coefficients. Multiplication by x + x leads to the recurrence 



fn+l 



By Corollary 13.541 it follows that f n < g n . This is an elementary fact about 
polynomials whose roots all have negative imaginary part. See Chapter [201 

Example 3.60. Here is another example where matrix multiplication natu- 
rally arises (see Theorem 17.641 . Suppose that f (x) is a polynomial. Write 
f(x) = f e (x 2 ) + xf D (x 2 ). The even part of f is f e (x), and the odd part is f M- 
For example, consider powers of x + 1: 





(x + 1) 3 


(x + 1) 4 


(x + 1) 5 


even part 


3x + l 


x 2 + 6x + 1 


5x 2 + lOx + 1 


odd part 


x + 3 


4x + 4 


x 2 + lOx + 5 



It's clear from the definition that if g(x) = (x + a) f (x) then 



g(x) = af e (x 2 ) +x 2 f D (x 2 ) +xf e (x 2 ) + axf D (x 2 ) 
g e (x) = af e (x) +xf D (x) 
QoM = feM + af (x) 

which we can express as the matrix product 

a x\ /f e \ / g e 
1 a) \i ) \q 

We will see in Example 13.741 that if h < — k have positive leading coeffi- 
cients, and h, k lie in P pos then 
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implies that r < — s. However, the matrix does not preserve arbitrary pairs of 
interlacing polynomials: 

(\ x\ f[x + 2)(x - 2)\ _ (-4 - 3 x + 3 x 2 + x 3 \ 
\1 l) \{x-l)[x + 3)J ~ \ -7 + 2x + 2x 2 ) 

The latter two polynomials are in P, but they don't interlace. However, it is 
worth noticing that 

(I x\ 2 /(x + 2)(x-2)\ _ /-4-10x + 5x 2 + 3x 3 \ 
[l l) \{x-l){x + 3)J ~ \ -ll-x + 5x 2 + x 3 J 

and the latter two polynomials do interlace. This is a general fact: 

Lemma 3.61. If a > then the matrix ( f « ) 2 weakly preserves interlacing. 

Proof. First of all, ( f * ) 2 = ( x 2 ^ 2 ) . We notice that 

fx + a 2 2ax \ (A _ f{x + oc 2 )r + 2ocxs\ _ (u\ 
\ 2a x+ot 2 ) \s) ~ \2oa+ [x + oc 2 )s ) ~ \v) 

and both u and v are in P. 

If we follow the proof of Proposition EH then 2 «* 2 = \ f £ | 2 = 

(x — a) 2 has a zero at x = a, and is otherwise positive. As long as u and v 
do not have a for a zero then the argument of the proposition applies. This is 
the case if and only if r(a) + s(a) = 0. If we choose e so that s e (x) = s(x + e) 
satisfies r^;s e , r(ct) + s e (oc) ^ 0, then we can apply the argument of the 
proposition. Taking limits finishes the proof. 

□ 

Polynomials of equal degree 

For the rest of this section we will always assume that 

deg(f) = deg(g) = deg(h) = deg(k) 



h k)(s) (v 

Lemma r3.66| in the next section shows that in order to get non-trivial results 
we must assume that f , g, h, k, r, s all have positive leading coefficients. We 
consider two different ways that interlacing can be preserved. 

(<C) r<s implies u -c v 
(<) r<s implies u<v 



These conditions put constraints on f , g, H, k. 
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Lemma 3.62. Suppose (! 2) (I) = (v)' an & consider the two interlacings 

f + ag <h + akfor a > (3.8.1) 
xf + ag < xh + akfor a > (3.8.2) 

t<s implies u v ==> (|3.8. 1|) 
r <s implies u<v => (13.8.2b 

Proof. These cases are all similar. When ( <c ) holds then we take r — x and 
s = ax. When (<) holds then we set r = x 2 and s = ax. Since s must have a 
positive leading coefficient we require that a is positive. The results follow by 
computing u, v and dividing the interlacing u < — v by x. □ 

If (13.8.11 holds then we see from considering the left side that f and g have 
a common interlacing. If both d3.8.1|) and l|3.8.2|l then Lemma l2.13l shows that 
we have true interlacing. 

Remark 3.63. If we only require that f < s implies that u <Jv and do not require 
that the degrees of f , g, h, k are equal then we do not get | |3.8.2|| , and so have 
more possibilities. In particular note that if r < s and ( * ~ b ) ( r s ) — ( " ) then 
u<cv. However, then entries of ( * ~ b ) do not satisfy I l3.8.2t . 

When | |3.8. 1\ and 1 13.8. 2\ hold we can combine the restrictions into one poly- 
nomial. 

Lemma 3.64. If J3.8.1| > and d3.8.2| > hold then for any y, z g R we have that 

g + zf + uk + uzhe P 

Proof. For any positive a and any y £ R we know that 

g + af + u(k+ ah) £ P 
g + axf + y (k + axh) £ P 

Rewriting these yields 

(g+uk) + a(f + yh) £ P 
(g+uk) + ax(f + yh) £ P 

Since g + yk £ P and f + yh £ P we apply Lemma 12.131 to conclude that for 

any z £ R 

(g+uk) +z(f + yh) £ P 

□ 

We will study such polynomials in Chapter [HI 
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3.9 Matrices of polynomials that preserve interlacing 

We investigate the set S of all matrices M with polynomial entries such that if 
v is a vector of mutually interlacing polynomials then so is Mv. § is a semi- 
group since it's closed under multiplication, and S contains all totally positive 
matrices. In this section we give a few general constructions. 

We do not have a complete analog to Lemma l3.51 but just a necessary con- 
dition. Notice that the order of interlacing is reversed. 

Corollary 3.65. Suppose fi,...,f n is a sequence of polynomials of the same de- 
gree and all positive leading coefficients. If for all mutually interlacing polynomials 
gi <|C • • • g n with positive leading coefficients the polynomial figi + ■ • • + i n 9n 
has all real roots, then the ft are mutually interlacing, and 

f n < • • • <f 2 <fi 

Proof. By taking limits we may assume that all but gt and g,, i < j, are zero. 
Now apply Lemma [l.38l to find that g, <C gt. □ 

We have seen an example of this reverse order in Lemma 11.191 Note that 
we assumed that all the polynomials had positive leading coefficients. This is 
not a convenience: 

Lemma 3.66. Suppose that f<g have positive leading coefficients. If for every r<^s 
we have that f s + gr e P, then r and s have leading coefficients of the same sign. 

Proof. Choose r — f and s = — g'. From Lemma [1.55| we know that f s + gr = 
f/ g/ is never zero, and so has no real roots. □ 

We do not have a satisfactory answer to the problem of when mutual in- 
terlacing is preserved. We will reconsider this question in § !24l24.6l where we 
find relationships with polynomials in three variables. Here are some neces- 
sary conditions; they are easily seen not to be sufficient. 

Lemma 3.67. Suppose that A — (py ) is a m by n matrix of polynomials of the same 
degree with the property that for all vectors f = (fi, . . . , f n ) of mutually interlacing 
polynomials, the polynomials Af = (gi, . . . , g m ) are mutually interlacing. Then 

• The polynomials in any column of A are mutually interlacing. 

Pi,j <P2,j < • • • <Pn,j 

• The polynomials in any row of A are mutually interlacing in the reverse order: 

Pj,l>Pj,2> ••• >Pj,n 

Proof. By continuity, we can apply A to the vector (f, 0, . . .,0). This implies 
that all columns are mutually interlacing. The interlacing in the rows follows 
from Corollary |3.65l □ 
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The next lemma determines some necessary conditions that depend on the 
coefficients. 

Lemma 3.68. Suppose that M is a matrix of polynomials of degree n and positive 
leading coefficients that preserves mutually interlacing polynomials. The matrices of 
constant terms and of leading coefficients ofM are both totally positives- 
Proof. It suffices to prove this for two by two matrices. So, assume that 
M= ( 

\ U ) where ft has constant term d and leading coefficient dt. Let M T 
be the result of substituting rx for xinM. If r is positive then M r preserves in- 
terlacing. Consequently, lim r ^o M T = ( c\ cl ) preserves mutually interlacing 

polynomials. Similarly, lim r ^oo r~ n M T = f ^ preserves mutually inter- 
lacing polynomials. The conclusion now follows from Theorem l3.12l □ 

We can use the Leibnitz lemma (Lemma 11.261 1 to get a some classes of el- 
ementary matrices that are analogous to the Jacobi matrices 1 13.1. It . Choose 
g f with positive leading coefficients. Assume that r <c s and compute 

(S?)m = ( fr f + r). 

Lemma ll ,26| implies that fr+ gs <C f s, so ( Q ® ) preserves interlacing. Similarly, 
the matrix ( ® jj J also preserves interlacing. We can generalize this construc- 
tion. 

The first elementary matrix Et has f on its diagonal, g in position i, i + 1 
for some i, and the remaining entries are 0. The second elementary matrix 
E? is not the transpose, but has g on its diagonal, f in position i + l,i, and 
the remaining entries are 0. The last elementary matrix is the diagonal ma- 
trix f I. The previous paragraph shows that these matrices preserve vectors of 
mutually interlacing polynomials. For example with n = 4 and i = 2 these 
elementary matrices are 





































°\ 


E 2 = 






f 




9 
f 






t 2 — 






9 
f 



9 






f I = 






f 






f 




























9) 




v° 








V 



With these matrices we can construct examples of transformations that 
preserve mutually interlacing sequences. For instance, if n is 4 then 



E 1 E 2 E 3 )(E 1 E,]E 



/f 6 3f 5 g 3f 4 g 2 f 3 g 3 \ 

f 6 2f 5 g f 4 g 2 

f 6 f 5 g 

\0 f 6 J 
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and consequently if hi <C h 2 «h 3 <C h 4 is mutually interlacing then (after fac- 
toring out f 3 ) the following are mutually interlacing. 

f 3 hj + 3 f 2 gh 2 + 3 f g 2 h 3 + g 3 h 4 < f 3 h 2 + 2 f 2 gh 3 + f g 2 h 4 

<f 3 h 3 + f 2 gh 4 



In general, 



(E 1 ..-E T KE 1 ...E r _ 1 )...(E 2 E 1 )E 1 = g'" l f 



: Cf 3 h4 



d + l-i 



and consequently the following sequence of polynomials is mutually interlac- 
ing 



L9 J " lfd+lH ( d }^ i ) Hl 



i = l,...,d 



Example 3.69. By taking coefficients of polynomials in P 3 we can construct 
matrices of polynomials such that all rows and columns are interlacing, and 
all two by two determinants are positive. Here's an example of a 3 x 3 matrix 
of linear terms 

/ 86x + l 625x+76 1210x + 260\ 

876x + 61 3953x + 666 4707x + 1351 
\1830x + 204 5208x+1128 3216x+1278/ 

and one with quadratic terms 



644x 2 + 114x+l 9344x 2 + 3019x + 81 30483x 2 + 13579x + 466 \ 

7816x 2 + 2102x + 78 52036x 2 + 24710x + 1206 96288x 2 + 73799x + 4818 
19820x 2 + 7918x + 417 72132x 2 + 58752x + 4114 59610x 2 + 110860x + 11777/ 

3.10 Matrices preserving interlacing in P pos 

The class of matrices preserving interlacing in P pos contains more than the 
totally positive matrices. For example, if f < g in P pos , then xg <— f. The 
corresponding matrix ( 5 J ) preserves interlacing in P pos , but not in P. In this 
section we will restrict ourselves to the following matrices: 

An NX matrix is a matrix whose entries are either non-negative 
constants, or positive multiples of x. 

We begin by determining which NX vectors map to P pos . 

Lemma 3.70. v is an NX Ibyn matrix that maps all mutually interlacing sequences 
of polynomials in P pos to P v ° s if and only if all multiples ofx in v occur to the right 
of all the positive constants. 
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Proof. If a multiple of x is to the left of a positive constant, then we may ignore 
all other coordinates, and just take v = (x, 1). In this case 

f = X g = x + 1 f < g xf+g=x 2 +x + l^P 

Conversely, assume that v = (ai,...,a T ,xa T +i, ...,xa n ), and let F = 
(f i, . . . , f n ) be mutually interlacing. Define 

r n. 

gi = ^2 a k f k g 2 = Y_ a ^ f k 

k=l k=r+l 

Note that gi <C g2 since the ft are mutually interlacing. Consequently, xg 2 < gi 
since all ft are in P pos , and sovF = g 1 +xg 2 € P pos . □ 

Next we look at two by two matrices in NX. 

Lemma 3.71. The two by two matrices listed below preserve mutually interlacing 
polynomials in P v ° s if they satisfy 

determinant is ^ 
determinant is ^ 
no restriction 

Proof. The • stands for a non-negative constant, and X stands for a positive 
multiple of x. Since two by two matrices with positive determinant preserve 
interlacing, the first two matrices preserve interlacing. The third and fourth 
cases follow from the factorizations 

/ ax bx\ _/0x\/c d\ fa cx\ _ / c a\ f0 x\ 

d)~\l 0j\a b) \b dx) ~ \d b) \1 Oj 

The last two cases are easy and omitted. □ 

Proposition 3.72. An NX matrix preserves mutually interlacing sequences of poly- 
nomials in P vos if and only if 

1. All two by two submatrices satisfy the conditions ofLemma \3.71\ 

2. All elements that lie above or to the right of a multiple ofx are also multiples of 
x. 

Proof. It suffices to take two rows, and show that they determine interlacing 
polynomials. Choose mutually interlacing polynomials (fi, . . . , f n ). Let a r 
be the rightmost nonnegative constant in the first row, and q s the rightmost 
nonnegative constant in the second row. By the second hypothesis we know 
that r ^ s. We denote the two rows by 

(ai, a. z ,..., a T , x a r+ i, ...,xa n ) (b 1; b 2 , . . . , b S; x b s+1 , . . . , x b n ) 



1: CD 

3: (5J) 
5: C?) 



2: ( X A) 
4: (Ix) 
6: ( X A) 
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Define 

T S TL 

gi = Y Q i f i 92 = ^ Qifi 93 = X. Qifi 

i=l i=T+l i=s+l 

r s n 

i=l i=r+l i=s+l 

We need to show that 

gi + x g 2 + x g 3 < hi + h 2 + x h 3 

The interlacings hi<Cg 2 , gi^h 2/ g 2 <g3, g2<h 3 follow from 
the mutual interlacing of the fi's. The interlacing h 2 <Cg 2 follows from 
the hypothesis that ( * * ) has negative determinant. We now show that 
gi + x g 3 <§; hi + x h 3 . Note that 

F = (xf s +i/...,xf n ,fi,... / f T ) 

is a mutually interlacing sequence. The assumptions on the two by two ma- 
trices imply that 



M = 



a s+ i ... a n ai ... a r 
b s+ i ... b n bi ... b T 



is TP 2 . Since MF = (gi + x g 3; hi + xh 3 ), we find 

gi +xg 3 <hi +xh 3 
xg 2 <hi +xh 3 
gi +xg 2 +xg 3 <hi +xh 3 
gi +xg 2 + xg 3 <h 2 

Adding these last two yields the conclusion. 

Conversely, we know from Lemma [3770] that all the x's occur to the right of 
the constants. If (1) is satisfied, and (2) is not, then r < s. It follows from the 
first part that the interlacing is in the wrong direction. 

Suppose that a two by two matrix listed in Lemma 13.711 does not satisfy 
the condition. In the first three cases we know the interlacing is reversed. In 
the fourth case we take the matrix to be (J d x x ) and assume that < c < d. 
Applying this to X;C x yields x(x + 1) ^> x(c + dx) since < c < d. □ 

Example 3.73. The matrix below preserves mutually interlacing sequences in 
P pos since all the determinants are zero, and the x's satisfy the second condi- 
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tion. 



(I x x 

1 1 X 

111 



1 1 
\1 1 



x x\ 

X X 
X X 



X 

V 



Example 3.74. Suppose that M is a totally positive matrix, and we form a new 
matrix N by replacing the upper right corner element by x. All the hypothesis 
are met, so N preserves mutual interlacing in P pos . 

Example 3.75. We can construct matrices with more complicated polynomials 
by multiplying these examples together. For instance, if 



mi 




m.2 




then the product mi m.2 mi below preserves mutually interlacing polynomials 
inP pos : 

6 x + 2 x 2 3 x + 5 x 2 ^ 
l + 6x + x 2 5x + 3x 2 
4 + 4x l + 7x 

Example 3.76. Here's an example where the matrix is not immediately 
obvious 11701 Theorem 2.8]. Suppose t is a positive integer and that 
f o (x), . . . , f T _i (x) are mutually interlacing. If 




(1 + x + ■ • • + x r - : ) (f (x r ) + xfi(x r ) + 



got* 



+ x r - 1 f r _i(x r )) 
)+xgi(x T ) + ---- 



-ilx' 



then go(x), . . . , g r _i(x) are also mutually interlacing. This follows from the 
observation that 



(1 x x 
1 1 x 
111 



A 

X 
X 



1 1 
\1 1 



x 

V 



/ fo(x) \ 
fi(x) 
f 2 (x) 

Tr-2(X 
Vfr-l(x)/ 



/ 9o(x) \ 
91 (x) 
92M 



9r-2(x 



Example 3.77. Here's a similar example. Choose a positive integer r, and 
suppose that g(x) = (x + oc)f(x). Write 



T-l 



r-l 



f(x) ^xW 



g(x) ^^x^gifx 1 -) 



k=0 



k=0 
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If {fo, . . . , f r -i} are mutually interlacing then so are {go, . . . , g r -i}- The proof is 
the observation that 



fa 





. 


■ x \ 


1 


a 


. 


. 





1 


a . 


. 


V° 





. 


. a) 



( foM \ 

f T _ 2 (x 



/ goM \ 

9i M 



9r-2(X 



This example can be used to give another proof of Theorem l7.65l 



3.11 Linear 2 by 2 matrices preserving interlacing 

We are interested in determining when the matrix 

/a a x + bi a 2 x + b 2 \ 
\^a3X+D3 a 4 x + b 4 y 

preserves interlacing in P pos . We give a nice construction due to Liu and Wang 
11171 . There are constructions for matrices of higher degree that preserve in- 
terlacing (See p. 16931 ). 

We begin by specializing to the matrix M = ( X ^ Q x * b ) . If we apply M to 
the vector (x, ax) where a > then the result is a pair of interlacing polyno- 
mials. After factoring out x 

x+a + xtx<Cl + (b + x)q 
-a/(l + a) ^ (-1 -ba)/a 
P = a 2 b + a(b + 1 - a) + 1 > for all a ^ 

Next, apply M to (x 2 , ax). Factoring out x yields 

x(x + a + a) < x(l + a) + bet 



Substituting the root of the second term into the first gives 

— ba , — ba 

-r— Lt— + a + a) <0 
1 + a 1 + a 

R = a 2 + (a + 1 - b)a+ a ^ for all a ^ 

Note that R and P have the same discriminant Q = (a — b) 2 — 2 ( a + b ) + 1 . The 
graph of Q = is a parabola, and is given in Figure [3731 

Lemma 3.78. ( x ^ a x x b ) preserves interlacing in P v ° s if and only (a, b) is either in 
the parabola, or in the finite region bounded by the axes and parabola. See Figure |373l 
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Proof. Q intersects the axes at (1,0) and (0,1) and is negative in the region 
inside the parabola, so P and R are non-negative for those (a, b). If (a, b) is in 
the unbounded region below the parabola, then Q is positive, and b + 1 — a 
is negative. Thus, P has two positive roots, and so is not positive for positive 
a. R has two positive roots in the region above the parabola. Thus, if M 
preserves interlacing then ( a, b ) can't be in either unbounded region outside 
the parabola. 

Lemma r3.79l shows that all points in the gray region of Figure l3~3l determine 
matrices preserving interlacing. We can use Example 111.611 to show that all 
( a, b ) in the parabola determine interlacing preserving matrices. □ 




Figure 3.3: The graph of (a - b) 2 - 2(a + b) + 1 = 



Lemma 3.79. If a and b are positive, and \a — b ^ 1 then ( x | a x * b ) preserves 
interlacing in P v ° s . 

Proof. We first claim that ( x ^ x * t ) preserves interlacing for positive t. If we 
write 

'x + t x \/f\ /(x + t)f + xg N 



1 x + tj \gj \f + (x + t)g 

then each of the two terms on the bottom interlaces each of the two terms of 
the top. Similarly, gff +xq) -Cf + (x+t)g, so ( f ^ t ) preserves interlacing for 
any positive a. 

Since ( " £+ t ) and ( x * t ) have the same second row, the matrix 

( x+t+ct (cc+l)x 



x+ i±a x 



i x+tj preserves interlacing. Multiplying the first row and the sec- 
ond column by l/(a + 1), and replacing x by (a + l)x shows that 

(3.11.2) 



L 



preserves interlacing. Suppose that b + 1 ^ a ^ b. If we choose t = b/(b + 
1 — a) and a = (a — b)/(b + l — a) then t and a are both non-negative, and 
1 13.1 1.21 reduces to ( x \ a x x b ) . The case a+l^b^ais similar. □ 

We have seen that the criterion for a 2 x 2 matrix of polynomials to preserve 
interlacing is that the determinant is always positive. If we restrict ourselves 
to P pos then we only need that the determinant is positive for negative x. The 
following result is due to Liu and Wang lll7l . 
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x+a 
1 



Lemma 3.80. Let M = (** £ J where c > 0, £i,« 2 G P pos (l) and q e P pos (2), 

rotere l\,l2, q fezye positive leading coefficients. If |M| ^ 0/or all x ^ then M 
preserves interlacing in p pos . 

Proof. Let M(f, g) 1 = (F, G)* where f < g in P pos . The root a of l\ is negative, 
and since the determinant is non-negative at a, it follows that q(a) < and 
so q < l\. From Leibnitz (p.O we know that F and G are in P pos . 

Since G has positive coefficients and negative roots, |M| is non-negative at 
the roots of G. In addition, we have G = cf + £ 2 g < g- The identity 

cF = h G - |M| g 

shows that F sign interlaces G . Since the degree of F is greater than the degree 
of G we conclude that F < G . □ 

If q is linear then we get 

Corollary 3.81. Let M = ( ^ ^ ) where c> and l x , t 2 , l 3 e P pos (1) have positive 
leading coefficients. If\M\ ^ 0/or all x < then M preserves interlacing in pv° s . 

Note that the corollary provides another proof of Lemma 13.781 since 
is (x + a)(x + b) — x, and this determines the same region that 
is described in the proposition. 
Here is a simple consequence. 

Corollary 3.82. ( * X ^ Y ) and ( X + Y x ) preserve interlacing in P pos if and only if 
y 1 . 

Proof. Existence follows from the lemma. If -y > 1 then (0,y) and (y, 0) lie 
outside the parabola. □ 

Lemma 3.83. Suppose that a, b, c, d are positive, and I " d ^ 0- 
2. ( £ x ^ b ) preserves interlacing in P v ° s . 

2. ( £ ) preserves interlacing P v ° s if and only if c > a, d ^ b. 

3. ( x + a x ^ b ) preserves interlacing P v ° s if and only if a ^ b, c ^ d. 

4. ( x +£ x +^ ) preserves interlacing p pos zjand onZy if 
c ^ a ^ b, fl«d (c — a)(a — b) ^ ad — be ^ 0. 

Proof. The first one follows from the decomposition + (c §)■ K c ^ a 

and d b then 

a x + b\ /l 0\ / a x + b 
c x+dj ~ ll 1/ [c-a d-b 
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shows that ( £ ) preserves interlacing. Conversely, if interlacing is pre- 
served, then applying ( " ) to (1, a) yields 

a + a(x + b) <|C c + a(x + d) 

Taking roots shows that ocb + a ^ ad + c for all positive a which implies c ^ a 
and d ^ b. 

The third one is similar: sufficiency follows from 

fx+a x + b\_/a-b x + b\/l 1\ 
^ c d J ~ \c-d dJ^OlJ 

and necessity is a consequence of applying ( x + a x ^ b ) to (x, a). For the last 
one, we first compute 

f\ 0\ fx + a x Wl \ fa b\ fx + a x + b\ 
[l l)\c-a ^E^J \P a-bj^0 lj~l v x + c x + dj 

By Corollary EH we see that ( ) preserves interlacing if and only if 

( c-q ad -* c 1 does. From (3) this is the case if and only if c — a ^ a ^~^ c . □ 



3.12 Polynomial sequences from matrices 

A totally positive or NX n x n matrix determines sequences of polynomials 
obeying an n'th order recurrence relation. We first recall how to get sequences 
satisfying a recurrence relation from a matrix. Suppose that M is a matrix, v is 
a vector, and the characteristic polynomial of M is 

q + aiy + a 2 y 2 H h a n y n . 

Since M satisfies its characteristic polynomial, we have 

a (v) + qi(Mv) + q 2 (M 2 v) + • • • + a n (M n v) = 0. 

If we fix an integer k and define Wt to be the kth coordinate of M x v then the 
wt also satisfy the same recurrence relation 

no wo + ai wi + ci2 W2 + • • • + a n w n = 0. (3.12.1) 

We now let M be a totally positive matrix, and let v be a matrix of mutually 
interlacing polynomials. Since JVPv = Vv is a vector of mutually interlacing 
polynomials, all the coordinates of Vi are are in P. These polynomials satisfy 
the recurrence relation d3.12.1|) . All these polynomials have the same degree 
but consecutive vt do not necessarily interlace. 

Example 3.84. Consider an example where M = ( j 3 3 J and we choose vo to 
be three mutually interlacing polynomials of degree 2. 
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v =( (l + x)(4 + x) (2 + x)(5 + x) (3 + x)(6 + x) ) 

vi =( 2(21 + 14x + 2x 2 ) 88 + 53x+7x 2 152 + 87x + llx 2 ) 

v 2 =( 370 + 221x + 29x 2 2(381 + 224x + 29x 2 ) 2(653 + 381x + 49x 2 ) ) 

If we just consider the second column, and observe that the characteristic 
polynomial is 1 — 12y + 10y 2 — y 3 , then we conclude that all polynomials 
defined by the sequence below are in P pos . 



p n+3 = 10pn+2 - 12p n +l + Pn 

p = (2+x)(5 + x) 

pi = 88 + 53x + 7x 2 

p 2 = 2(381 +224x + 29x 2 ) 

We now consider NX matrices. These are more interesting, since the de- 
grees of the polynomials can increase. 

Example 3.85. We take the matrix M = ( \\ ) . If we begin with v = (x + 1, x + 
2) then the second column yields that p n e P pos , where 



Pn+2 = 2p n+1 + (X-I)p n 
P0 = X + 2 

pi = 2x + 3 



/ 1 X X \ 

Example 3.86. We now take the matrix M=(iix). If we begin with v 
(x + 1, x + 2, x + 3) then the third column yields that all p n e P pos , where 



Pn+3 = 3p n +2 + 3(x - l)p n +l + (x - l) 2 p n 
Po = X + 3 
Pi = 3x + 6 
p 2 = 3x 2 + 14x + 10 

Example 3.87. In this example we let M be the cube of the previous M. 

/l + 7x + x 2 6x + 3x 2 3x + 6x 2 \ 
M= 3 + 6x l + 7x + x 2 6x + 3x 2 
\ 6 + 3x 3 + 6x l + 7x + x 2 / 

Since every row has a quadratic term, the degree of the p n 's goes up by two 
at each step, so consecutive p n 's don't interlace. If we again start with v = 
(x + l,x + 2,x + 3) then all p n areinP pos : 
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p n+3 = 3(1 + 7x + X 2 ) Pn+2 - 3(X - l) 4 Pn+l + (x - l) 6 p n 

p = x + 3 

pi = 5 + 42x + 30x 2 + 4x 3 

p 2 = 8 + 260x + 947x 2 + 784x 3 + 181x 4 + 7x 5 

3.13 Interlacing via the complexes 

In this section we show how we can use properties of complex numbers to 
establish interlacing. Let UHP be the open upper half plane {z | 3{z) > 0}. 
Note that 

f£P» f (a) ^ for all a e UHP. 
Our first result connects interlacing and quotients. 

Lemma 3.88. Suppose that f, g are relatively prime and have positive leading coeffi- 
cients. The following are equivalent 

2. f <— g 

2. i : UHP — ► UHP 

g 

Proof. Suppose 1) holds. If we write 

g = aof + Y_ en — - — 

z — X — T| 

where roots (f) = (n) then 

9 , s \— 1 

- a = a + > at 

f z — a — T| 

If ae UHP then a-ne UHP, so l/(a— n) e -UHP, and therefore (g/f) (a) e 
-UHP. Taking inverses shows that (f/g)(a) e UHP. 

Conversely, if 2) holds and f + tg ^ P for some tel then there is a e UHP 
such that f (a) + tg(a) =0. Thus, (f/g)(a) ^ UHP. Thus f and g interlace, and 
the previous paragraph shows that the direction must be f < — g. □ 

The upper half plane is a cone - it is closed under positive sums. This 
additivity explains two fundamental results about interlacing. 

Lemma 3.89. Suppose that f, g, h, k in P have positive leading coefficients. 

1. Iff<g,hthenf<g + h. 

2. Iff < g, h^kthen fh<fk+ gh<gk. 
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Proof. If f < g,h and ff is in UHP, then ^(ff) € -UHP since f (cr) and £(ff) 
are, and -UHP is a cone. It follows that f/(g + h)(ff) € UHP. 
The second one is similar; it's just the addition of fractions: 



f ff) + -(ff) = M (ff). 
f h fh 



□ 



Note that the region C \ (— oo,0) is not a cone, so we should not expect 
f ~ g, H to imply g + h e P. Indeed, it's false. 

All points in the upper half plane can be realized by ratios of interlacing 
polynomials. 

Lemma 3.90. If a and cr are in UHP then there arei < g in P with positive leading 
coefficients such that = a. 

Proof. We will show that the set 



f(ff) 

g(ff) 



f<g 



equals UHP, considered as a subset of complex projective space. We know all 
elements of S lie in UHP. Choose anyf < g and let f (ff)/g(ff) = |3. Since SL^M) 
is transitive on UHP we can find M = ( £ Jj) in SL 2 (K) such that M (P) = a - 
Then 

af + b ^ 'a) = M ( f V) = M (P) = a 



v cf + dgj ^g y 
Since M has determinant one we know that af + bg < cf + dg. □ 

Matrices preserving interlacing are the same as matrices mapping UHP 
to UHP. The next lemma provides an alternative proof of Corollary 13.81 and 
clarifies why there is a simple determinant condition. Recall that if M has all 
real coefficients then M maps UHP to itself if and only if |M| > 0. 

Lemma 3.91. Suppose that Mis a 2 by 2 matrix of polynomials with positive leading 
coefficients. The following are equivalent 

1. M preserves interlacing. 

2. M(cr) maps UHP to UHP for all o e UHP. 

Proof. Assume that M preserves interlacing, and choose any cr and a in UHP. 
By Lemma 15. 90l we can find f < g such that f (cr)/g(cr) = a. Then 

[M(a)] (a) = [M(cr)] ( ) = [M ( I )] (ff) e UHP 

since M ( g ) is a pair of interlacing polynomials. 

Conversely, if f < g and M. ( g ) = ( q ) then for all ff £ UHP we know 

( Iw ) e UHP - Thus ( GM ) e UHP ' which impiies that F < G. □ 
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Figure 3.4: Mutually interlacing polynomials evaluated at cr 



3.14 Mutually interlacing polynomials 

We will generalize the following three equivalent statements that hold for a 
real 2 by 2 matrix M to mutually interlacing polynomials and strictly totally 
positive matrices2- 

1. M has positive determinant. 

2. M maps the upper half plane to itself. 

3. M preserves interlacing polynomials. 

First of all, mutually interlacing polynomials have a pretty geometric inter- 
pretation. Suppose that fi, . . .,f n are mutually interlacing. For any cr 6 UHP 
we know that f i ( cr)/f j ( cr) 6 UHP for i < ). In terms of angles this means that 
the angle from f j (cr) to fi(cr) is less than 7t. It follows that 

All the points {fi(cr)} lie in a half plane, and are , in counterclock- 
wise order, fi(cr), . . . , f n (cr). (Figure l3~4t 

A 2 by 2 matrix with real entries preserves the upper half plane iff it pre- 
serves interlacing. We know that an n by n matrix preserves interlacing if and 
only if it is totally positive2- We now introduce the structure preserved by 
strictly totally positive2 matrices. 



={(ui,...,u d 

For instance, 
H 



u 



- E UHP fori <i< j ^ d}. 



= {(u,v,w) I - e UHP, - g UHP, — e UHP} 

V w w 
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The space is a subset of P 2 (C), and is usually identified with UHP. A 
particular point in is u = (C d_1 , 1) where C = e nl/d . The character- 
ization of mutual interlacing is the same as for interlacing, and follows easily 
from the case d — 2. 

Lemma 3.92. Assume (f l7 . . . ,f d ) is a sequence of polynomials. The following are 
equivalent: 

1. (fi, . . . , f d) is strictly mutually interlacing. 

2. (fi(<T),...,fd(ff)) e H+forallve UHP. 

Let Td be the semigroup of all invertible matrices with non-negative en- 
tries for which the determinants of all two by two submatrices are positive. 
For example, T 2 consists of all two by two matrices with non-negative entries 
and positive determinant. 

Lemma 3.93. Td acts transitively on Hj. 

Proof. We know that T 2 acts transitively on UHP. Choose two elements 
(ui, . . .,Ud) and (vj, . . ., v d ) of Hj. The block matrix 




where M is a 2 by 2 matrix in T 2 preserves H J . Since T 2 is transitive we first 
carry Ui,u 2 to Vi,v 2 , then leaving the first coordinate fixed we carry v 2 ,U3 to 
v 2 , v 3 and so on. □ 

Lemma 3.94. For any u e H J and cr e UHP there is a vector f of strictly mutually 
interlacing polynomials such that f (cr) = u. 

Proof. Choose any vector g of mutually interlacing polynomials. Since g(a) € 
there is an M e T d such that M(g(a)) = u. Thus u = (Mg)(cf), so Mg is 
the desired vector. □ 

Proposition 3.95. Suppose that Mis a d by d matrix of positive real numbers. The 
following are equivalent 

1. M preserves mutually interlacing polynomials. 

2. M is strictly totally positivei- 

3. M maps to itself. 

Proof. We have already seen the first two equivalences. Assume M maps 
to itself. For any cr e UHP and mutually interlacing fj < f 2 • • • < fd we know 
that 

[M • (fj f d )](a) = M • (fi(cr), . . .,fd(cr)) e H+ 
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since (fi(cr),. . .,f d (cr)) e Hj. Thus, M preserves interlacing, and so is totally 
positive2- 

Conversely, suppose that M is totally positive2, and u = (ui, . . . , Ud) is in 
Hj. Choose mutually interlacing f = (fx, . . .,f d) such that f (a) = u for some 
a e UHP. Then 

Mu = M(f(ff)) = (Mf)(a) e H+ 
since Mf consists of strictly totally interlacing polynomials. □ 

Remark 3.96. We give a direct proof that (2) implies (3). It suffices to take M to 
be the 2 by d matrix ( g ). Assume a = (at), b = (bt) and v = (vO- We show 
that Mv e UHP by computing the imaginary part of the quotient of Mv. 



Y_ a|V| \ _ 3 (Y. a i v i) (L b i v i) 



Ignoring the positive denominator, the imaginary part is 



Y_ Q i D il v i| 2 + Y. Q ^ b i V i V ) + Q ) b i v i v i = Y. 



b, b, 



All the two by two determinants are positive by hypothesis, and 3(v|Vj) is 
positive since vi/vj e UHP. 



Chapter 




Homogeneous Polynomials 



We begin the study of polynomials in two variables with a very special case: 
homogeneous polynomials. These results are generally well known, and will 
be generalized in later chapters. 

4.1 Introduction 

A homogeneous polynomial of degree n is a polynomial in x, y such that the 
degree of every term is n. A polynomial f of degree n in one variable deter- 
mines a homogeneous polynomial of degree n: 

F(x,y) = y n f(x/u). 

The coefficients of F are those of f . If 

f (x) = a + ajx + • ■ • + a n x n (4.1.1) 

(4.1.2) 

then the homogenized polynomial is 

F(x,y) = a y n + dixy^ 1 + • • • + a n x n (4.1.3) 

We can recover f from F since f (x) = F(x, 1). 

The key relation for us between homogeneous polynomials and polyno- 
mials in one variable is that factorization for homogeneous polynomials cor- 
responds to the property of having all real roots. 

Lemma 4.1. Let F(x,y ) be homogeneous, and set f = F(x, 1). We can factor F into 
linear factors if and only iff £ P. 

Proof. If f = F}(ajX + bt) then F = nt a i x + ^>iv) an d conversely. □ 
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The reversal of f , written f REV , is 

a n + a n _ix + • • ■ + a x rt . (4.1.4) 

We can express f REV in terms of F since f REV (x) = F(l,x). This shows that if f 
is in P then f REV is also in P. 

4.2 Polar derivatives 

Since a polynomial in x and y has derivatives with respect to both x and y 
we can construct a new derivative operator for polynomials in one variable. 
From(|4T3| we find 



9 

— F = naoy*- 1 + (n - l)a iy n - 2 x + • ■ ■ + a^" 1 . 
3D 

If we substitute y — 1, then the right hand side becomes 

ua + (n-l)x+--- + a n _ 1 x n - 1 (4.2.1) 

which can be written as 

n(a + aix H h a n x n ) — x(aj + 2a 2 x H h nx n _i) 

= nf - xf ' 

This is called the polar derivative of f and is written ^-f. The degree of ^-f 
is n — 1, and the formula for ^-f has an n in it. The polar derivative is not not 
defined on P, but only on P(n). It is useful to see that the polar derivative can 
be defined as follows: 

reverse f; differentiate; reverse 

Since these operations preserve P, the polar derivative is in P. 
If we assume f = 0( x — a t) then 

ff = 2>°0— (4-2.2) 
oy £ — x — at 

which is similar to the representation for the derivative 

A f = y f(x) 

3x ^— x — a.i 

If f G P ± then the polar derivative has all real roots 1 by Lemma 12.111 Since 
polar derivatives act like derivatives, it is not surprising that we have this 
result (due to Laguerre). In fact, in Chapter|9]we will see that they are exactly 
derivatives. 



'It fails to have all real roots if there are positive and negative roots - e.g. (x + l) 2 {x — 2) 
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Lemma 4.2. Iff e P ± and f /zas aZZ distinct roots, then f < ^jf. 77ie po/ar derivative 
maps P vos to itself and P alt to z'fseZf 

Proof. The result follows from Lemma fl .201 and l|4.2.2t . The mapping proper- 
ties follow from 114.2.11 . □ 

Corollary 4.3. Iff G P* ffren f-a^^fe P for a > 0. 

Proof. Use Lemma l4~2l and Corollary IZlOl □ 

Lemma 4.4. If f = do H + a n x n z's in P f/zezz and ^-f interlace. Iff has 

positive leading coefficient then 

• If a n _i z's negative then <C ^jf. 

• If a n _i z's positive then -§zf~> -^f- 

Proof. After dividing by a n we may assume that f is monic. Since = 
— xf + nf we see that sign interlaces f. The sign of •ggf at the largest 
root of f is negative. Since the leading coefficient of ^-f is a n -i/ the result 
follows. □ 

Polar derivatives preserve interlacing, but the direction depends on certain 
of the coefficients. 

Lemma 4.5. Suppose that f — Y_o ck* 1 and g = Y_ o ^H* 1 satisfy f<g. 

• If a n _ 2 /a n _i > b n _ 2 /b n -i then < -^g 

• If a n -2/a n -i < b^/bn-i then < 

. Iff, g e P Qlt or f, g e pv° s then « ^g 

Proof. Since maps P to P, it follows from Theorem II .431 that ^-f and ^-g 
interlace. If we compute ^-f by reversing f, differentiating, and then revers- 
ing, we see that if f has roots all the same sign then preserves interlacing. 
It remains to find the direction of interlacing. 

In general, if <C ^9/ then we can differentiate n — 2 times to find that 

u!a n _ 1 x+ (n- l)!a n _ 2 < u!b n _!X + (n- l)!b n _ 2 

If ^j-g <C •g^f then this interlacing would be reversed. This implies the result. 

□ 

We can consider the poset of all partial derivatives of a polynomial f . In 
case f is in P* then we have predictable interfacings. For instance, 
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Corollary 4.6. Iff£P pos isa polynomial of degree n, then whenever i + ) ^nwe 
have 



The poset of partial derivatives is log concave. 



d r _d a_x 

3i) 1 dx dij 1 



< 



Lemma 4.7. //f e P pos faas all distinct roots then 
Proof. From the definition we have 

f *f _ f 

ay 1 <>x <>y 1 

Since f e P pos we know f ^> xf, and the result follows from Lemma [1.551 □ 



f 




f f 


-(xf)' 




xf (xf)' 



4.3 Newton's inequalities 

We are going to derive relations among the coefficients of one or two polyno- 
mials whose roots have appropriate properties. The idea is to differentiate the 
polynomials until we get a polynomial of degree 1 or 2, and then use proper- 
ties of linear and quadratic polynomials. 

It is immediate from the quadratic formula that if that f (x) = ax 2 + bx + c 
has two distinct real roots then b 2 > 4ac. We can use the quadratic formula to 
derive inequalities for the coefficients of a polynomial that has all real roots. 



Suppose that we are given a fourth degree polynomial f 

3 



do + aiX + Q2X Z 



CI3X 3 + CI4X 4 with all distinct roots. Then, ^Lf has all distinct roots. Differentiat- 
ing once more, the second derivative 2Q2+6Q3X+I2Q4X 2 has two distinct roots, 
so we conclude that 36a 2 > 96Q2Q4, or a 2 > (4/3)a2Ci4. Newton generalized 
this argument. 

Theorem 4.8 (Newton's Inequalities). Suppose that the polynomial 
a n x n H H ao has all real roots. 



1. If the roots are all distinct then 

1,,. 1 
k 



al > (1 + 



(1 



lau-iQic+i, for k= l,...,u-l. 



n-k 

2. If the roots are not necessarily distinct, then 

a k > a k _ia k+1 /ork= l,...,n-l. 
unless a k = a k _i = or a k = a k +i = 



(4.3.1) 



(4.3.2) 



Proof. Let F be the homogeneous polynomial corresponding to f . Since par- 
tial derivatives preserve distinct real roots, if f has all distinct roots then the 
polynomial 



»k— 1 an— k— 1 



k— 1 Ai.n— k— 1 



9x k 1 dy 



( O-k-1 2 , o a k . a k+l 2 1 
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has all real roots, and they are all distinct. Consequently, 

W 2 >£hl.M (433) 

which reduces to 114.3. Q . 

If the roots of f aren't distinct, then the inequality in ||4.3. Il l is replaced by 
^. If both a^-i and a^+i are non-zero, then | |4.3. 2\ follows. If one of them is 
zero then the only way for | |4.3. 21 to fail is if ak = 0. □ 

There is a kind of converse to Newton's inequalities, see § 1515.41 If all the 
coefficients are non-zero then we can restate the conclusion in terms of the 

Newton quotients: 

^ ^Tj+lTL-IC+1 (434) 



ak-ia k+ i k n-k 

Example 4.9. We can use the Newton quotient to get inequalities for more 
widely separated coefficients. Suppose f(x) = Y_ a** 1 € P pos (n). From ||4.3. 41 
we get 

4 /k + ln-k+l\ 2 2 2 

Qv ^ — : Qv_iCl 



/k+ln-k+l\ 2 k n-k + 2 k + 2 n-k 

n— k / k— In— k+1 k+ln— k— 1 



al ^ /(k + 2)(k+l) (n-k + 2)(n-k + l) 



a k -2ak+2 y k(k-l) (n-k)(n-k-l) 
If we iterate this inequality, then for any r satisfying k ^ 2 r , n — k ^ 2 r 



4 > 2 r/ (k + l) 2T (n-k + l) 2r 
ak-2'-ak+2'- ^ Y (k) 2 r ( n-k ) 7T 

A similar argument to the above puts an interesting restriction on consec- 
utive coefficients of a polynomial with all real roots. 

Lemma 4.10. If a polynomial f has all real roots, and two consecutive coefficients of 
f are zero, then we can write f = x 2 g where g is a polynomial with all real roots. 

Proof. Write f = Y. ck* 1 / an d let F be the corresponding homogeneous poly- 
nomial. Since partial derivatives preserve roots, the polynomial 

G(X '^ 9x^-3 9u^ F 

= Cia k X 3 + C 2 Qk-lX 2 y + C 3 Q k -2XU 2 + C 4 Qk-3U 3 
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has all real roots, where C\, C2, C3, C4 are non-zero multiples of the original coef- 
ficients only depending on n and k. If consecutive coefficients cik-i = cik-2 = 
0, then G (x, 1 ) — C\ cikX 3 + C3 ak-3 . This polynomial does not have all real roots 
unless Qk-3 = or cik = 0. A similar argument shows that if we have r con- 
secutive coefficients that are 0, then if they do not include the constant term 
we can find a block of r + 1 consecutive zero coefficients. Continuing, we see 
that at the very least we must have that the constant term and the coefficient 
of x are 0. □ 



Corollary 4.11. If f (x) = ao H h a n x n has all real roots and not both ao and ai 

are zero then a? — ai+iat_i > Ofor i = 1, ...,n— 1. 

We can read Lemma r4.10l to say that if two consecutive coefficients are zero 
for f 6 P, then all earlier ones are also zero. We can slightly extend this to two 
variables. 

Lemma 4.12. Suppose f (x,y) = h g(x)u r + by T+1 + cy T+z + • • • satisfies 

f(cc,u) e P for all a e K 
then g has even degree, and cg(x) ^ Ofor all x. 

Proof. As is usual in these arguments, we differentiate r times with respect to 
y, reverse, and differentiate and reverse again until we have a quadratic. It 
has the form ag(x) + |3by +ycy 2 for positive a, (3,y. Since this has roots for all 
values of x we must have that g(x)c ^ 0, which implies g(x) must have even 
degree. □ 

This phenomenon is possible - for instance, take f = x 4 — y 2 . The corollary 
below is a two variable analog of Lemma [4. 101 

Corollary 4.13. 7/f(x,y) = Y. hMv i satisfies 

1. f(a,y) e P for all ael 

2. the degree of any ft is at most 1. 

3. Some two consecutive ft are constant, 
then all earlier f t are constant. 

Proof. Using the lemma inductively shows that all earlier coefficients are con- 
stant. □ 



4.4 More inequalities and bounds 

There are many inequalities beyond the Newton inequalities. It's known that 
the set of polynomials of fixed degree with real coefficients is a semi-algebraic 
set, and so is determined by a finite set of inequalities. Such a set is given by 
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Theorem l4.23l In this section we show how to get inequalities using majoriza- 
tion and the Arithmetic-Geometric inequality. We apply these to determining 
a bound for polynomials on an interval. 

The following well-known inequality uses the Arithmetic-Geometric in- 
equality. I don't know if it follows from Newton's inequalities. Lemma 14.151 
uses a weaker inequality that holds for all the coefficients. 

Lemma 4.14. Ifa + • • • + a n x n eP pos then 

L - a n-l ^ n a 0l n 
\ n-1 



2. ai > a n 

V na o/ 

The inequality is strict unless f = a(x + b) n for constants a, b. 
Proof. If the roots are —T\, . . . , — r n then 

= Tl • • • T n 

an 

a n -i 

= tH h r n 

a n 

Since all Ti are positive the Arithmetic-Geometric inequality implies that 



lQn-l Tl + ^ T n . , ,1/n /^ao^ 17 '' 

= ^{ri---r^ L/n 

n a n tl 

The second inequality follows from the first by reversing f . Equality holds in 
the AGI if and only if all roots are equal. □ 

We can use bounds on the coefficients to bound the size of a polynomial 
with all real roots. We begin with a simple consequence of MacLaurin's in- 
equality. The important point is that the upper bound does not depend on the 
degree of the polynomial. 



Lemma 4.15. I/f(x) = Y.o aix 1 e P pos then sup |f(x)| < |a | e 



Proof. Since f € pP° s we can apply Newton's inequalities, and so 



do ai \2a2/ \3a3/ Qk-i 



This gives a version of MacLaurin's inequality: 
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and therefore for r > 1 



X | SC T- 



k=0 



sup |f(x]| ^ r n ^ a k < a ^( — ) r n /k! < a expfraj/ao) 



□ 









Q m+1 


+ 3r 2 


Clm+2 








+ r 2 






) 



To find bounds for arbitrary polynomials in P we need a result of 111671 . 
Lemma 4.16. If f (x) = a m x m + • • • + a n x n € P and a m a n ^ then 



sup |f(x)| < |a m |r m exp( r 

x|<r 



Definition 4.17. Given two sequences a = (qi ^ <i2 ^5 • • • ^ Q n ) and b 
(bi ^ b2 < • • • ^ b n ) then we say a majorizes b, written b -< m a, if 

b n < a n 
b n -i + b n < a n _i + a n 

: < : 

b2 H h b n _i + b n < a 2 H h a n -i + a n 

bi + b 2 H h b n _i + b n = aj + a 2 H h a n _i + a n 



There is an important inequality between monomial symmetric functions 
due to Muirhead |75l[T64| . 

Theorem 4.18 (Muirhead). Given two sequences a ^ m b where 

a = (ai ^ Q2 ^ • • • ^ a n ) and b = (bj ^ b2 ^ • • • < b n ) then for all positive 

X\ f . . . , x n 

Lx ai x Q2 ■••x aTl < V x bl x bl •••x bn (4 4 1) 

cr(2) a(n) Z_ cr(l) A cr(2) cr(n) ^ ' 

o"GSym n a"£Sym n 

Example 4.19. Consider a cubic f = (x + ri)(x + T2)(x + r^) where ri,r 2 ,T3 
are positive. We also write f = Co + Cjx + C2X 2 + C3X 3 . We list the four index 
sequences that sum to 4 and the expansion (after homogenizing) of the sum 
in Muirhead's theorem in terms of the coefficients: 



Name 




expansion 


M004 

M013 
M022 

M 112 


0,0,4 
0,1,3 
0,2,2 
1,1,2 


2(c| - 4cic| c 3 + 2 c{ c\ + 4 c c 2 c§) 
-c 3 (2c 3 c 2 - c|ci + c c 2 c 3 ) 
2 (c 2 - 2c c 2 ) c 2 
2cqC 2 c 2 
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For example, 

M m = rir 2 r3 + nT§r 3 + r 2 r 2 r 3 
= rir 2 r 3 (ri +r 2 + r 3 ) 
= c c 2 

To homogenize we form — CoC 2 c 2 . There are four majorizations that 

yield (after simplifying) inequalities 

M 13 M 00 4 2C 2 - 9ClC 3 C 2 + 9c C 3 C 2 + 6C1C3 ^ 

M022 -<m M004 c| - 4cic 3 c| + 6c c 3 c 2 + c 2 c 3 Ss 

Mll2 M013 - 3C C 2 + C 2 ^ 

Mll2^mM022 - 2c 3 Cj + C 2 Ci - 3c c 2 c 3 ^0 

Although the third is Newton's inequality calculations show that the first one 
is not a consequence of the two Newton inequalities. 

4.5 The matrix of a transformation 

If T is a transformation on polynomials, then we define cp(T) to be the ma- 
trix of T in the basis {1, x, x 2 , x 3 , . . . }. We investigate the relation between the 
properties of T and the properties of cp(T). 

It is often the case that cp(T) is totally positive. However, T: P — > P does 
not imply cp(T) is totally positive, nor is the converse true. Consider the two 
examples: 

Example 4.20. Suppose that Tfx 1 ) = aix 1 , and D is the diagonal matrix whose 
i, i entry is Qt. Clearly D is totally positive, yet most transformations of this 
form do not preserve roots - see Theorem |15.22| 

Example 4.21. If T(f] = f (D)x 2 then we know that T: P — > P. However, 



(° 


1 


..A 





2 ... 




2 


... 




V 




/ 



where all the remaining entries are zero. This matrix is not totally positive. 

Although a transformation that preserves roots does not necessarily deter- 
mine a totally positive matrix, we do have a result for two by two submatrices 
if the direction of interlacing is preserved. 

Lemma 4.22. If the linear transformation T satisfies 
1 T - P v ° s > ppos 
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2. T preserves the sign of the leading coefficient. 

3. f <g implies T(f)<T(g) for f G P pos . 

then all two by two submatrices o/cp(T) are non-negative. (Thus cp(T) z's TP2.) 

Proof. It suffices to show that all two by two submatrices composed of adja- 
cent entries have non-negative determinants. Il6ll Since T(x n ) <J Tfx™ -1 ), it 
follows from Corollary 14 .3 1 1 that the determinants are non-negative. □ 

We now consider several examples where cp(T) is totally positive. If 
Mf (9) = f ■ g is the multiplication transformation, then we write <p(f) for 
(p(Mf ). One important case is when f = a + bx. 



4) (a + bx) 



/a b 

a b 

0a 





For another example, take f (x) = qq + ajx + CI2X 2 + CI3X 3 



cp(f) = 



The matrix of a derivative transformation is similar: 



/ ao 


ai 


Q 2 


a 3 











...\ 





a 


ai 


a 2 


a 3 
















a 


ai 


a 2 


a 3 
















a 


ai 


Q 2 


a 3 




{'■ 














'■•/ 



4>(f H^af + f) = 



fa 
1 a 
2 






a 
3 






a 



V 



If f (x) = qq + aix + a 2 x 2 + a 3 x 3 then 



(4.5.1) 



(4.5.2) 



(4.5.3) 



/ a 









\ 


ai 


a 









2a 2 


2ai 


a 


... 




6a 3 


6a 2 


3aj 


a ... 




24 a 4 


24a 3 


12a 2 


4ai ao 




V : 











q>(g^f(D)g) = 



Finally, the map of an affine transformation leads to binomial coefficients: 
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4>(f(x)>-»f(x + l}) 



/l 
110 

12 10 

13 3 1 



J 



The first result is well known ESI . 

Theorem 4.23. If a polynomial f is in p pos then cp(f) is totally positive. (25l 

Proof. We show by induction on the degree of f that cp(f) is totally positive. 
If Mf is multiplication by f then the composition of Mf and M g is Mf g , and 
consequently cp(fg) = cp(f )cp(g). From | |4.5. 11 it is immediate that 4>(a + bx) is 
a totally positive matrix if a and b are non-negative. Since every polynomial 
in P pos can be written as a product of factors of the form a + bx where a, b are 
positive, the conclusion follows from the fact that the product of two totally 
positive matrices is totally positive. □ 

Theorem 4.24. Iff e P pos and T(g) = f(D)g then cp(T) is totally positive. 

Proof. Same as above. Just note that f (D) is a composition of operators of the 
form f n af + (3f. From (|4. 5.31 1 it is clear that cp(f i — ^ af + |3f ) is totally 
positive. □ 

Remark 4.25. These last two results are equivalent since the two matrices <p(f ) 
and cp(g i— > f (D)g) are similar. This isn't surprising since differentiation and 
multiplication are adjoint. If F is the infinite diagonal matrix with ith diagonal 
entry i! then it is easy to see that 



Remark 4.26. It is actually the case that the converse of Theorem 14.231 is also 
true: if cp(f) is totally positive, then f e pP os . This is known as the Aissen- 
Schoenberg- Whitney Theorem [2 J. Also, note that the two by two determi- 
nants give Newton's inequalities. 

The general form of Hurwitz's theorem (see Theorem l7.641 follows imme- 
diately from this remark. 

Corollary 4.27. If Y. a i x i zs zn P v ° s then for any positive d the polynomial 
Y_ cidiX 1 is also in pv° s . 

Proof. The totally positive matrix corresponding to Y. a di xl is formed by tak- 
ing the submatrix of (|4.5.2b whose rows and columns are divisible by d. □ 

We can apply Theorem l4.23l to determinants involving evaluations at inte- 
gers. 



(p(g^f(D)g) ^Fcptf^F 



l 
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Corollary 4.28. Suppose f e pv° s / ^ j s fl positive integer, and a > 0. Then 
if(a+d) i^a+d-l) ... f(a) 



^0 



_L_f( a + 2 d) T23 L_f(a + 2d-l) ... if(a+d) 
Proof. Note that f(x + a) e P pos . From Lemma [7331 we know that 

k=0 

We can now apply Theorem l4.23l 



□ 



If f = 21 octx 1 then we can define an infinite matrix which is just <p(f) (see 
1 14.5.21 ) bordered by powers of x. For instance, if f has degree three then 



N(f) = 



f 1 


OCq 


ax 


«2 


«3 











X 





oc 


a i 




a 3 








X 2 








OCo 


ai 


a 2 


«3 





X 3 











Oo 


ai 


«2 


a 3 


X 4 














Oo 


«1 


oc 2 


V 

















(4.5.4) 



Proposition 4.29. Suppose f(x) = Y. Oi^ 1 " G P pos . ]fd zs odd t/ien aZZ d by d 
submatrices o/N(f) are non-negative for positive x. 

Proof. We proceed by induction on the degree n of f . If n is then 



N(f) = 



/I 



oco 

ceo 

a 









OCq 



We need to check that every d by d submatrix has non-negative determinant. 
If the first column isn't part of the submatrix then this is trivial. If the first 
column is part of the submatrix then it's easy to see that the only way the 
determinant is not zero is if the submatrix looks like (for d = 5): 



/ x m 


Oo 








°\ 


X m+X 





Oo 








x m+2 








OCo 
















ao 


v x m + 4 











0/ 



CHAPTER 4. HOMOGENEOUS POLYNOMIALS 



115 



The determinant is (— l) d_1 x m+4 0Cg _1 . Since d is odd, this is positive for pos- 
itive x. 

If we multiply N (f ) on the left by cp ( a + bx) (see 114.5. 2b ) then the result is 



a + bx 
(a + bx)x 
a + bx)x 2 



aao 





aai + bcxo 




aa2 + bai 
aai + bao 
aao 



V 



(4.5.5) 



This is nearly N((q + bx)f), except that the first column of [ 14. 5.51 1 is the first 
column of N ( ( a + bx)f ) multiplied by a + bx. 

Assume that the proposition holds true for N(f). Any submatrix of 
N ( ( a + bx)f ) not containing the first column has non-negative determinant. If 
a submatrix contains the first column, then its determinant is (a + bx) times 
the corresponding determinant in cp ( a+bx)N (f ), which is positive by Cauchy- 
Binet and the induction hypothesis. 

□ 



Remark 4.30. The case of d 
lary !4.39b for positive x. 



3 yields the cubic Newton inequalities (Corol- 



4.6 More coefficient inequalities 

An important consequence of Lemma 11.551 concerns the coefficients of inter- 
lacing polynomials. 

Corollary 4.31. If f = ao + h a n x n , g = b + h b n x n , f < — g, there is 

no index k such that — b^ = 0, and f, g have positive leading coefficients then for 
i = 0, 1, . . . ,n — 1 we have 

| aj | > (4.6.1) 

Proof. From Lemma [l.55l we have that 



> 



f(0) g(0) 


1 a-o a-i 1 


f'(0) g'(0) 


_ 1 b D bi | 



since zero is not a common root. In the general case we differentiate f g 
exactly i — 1 times. The result is the interlacing 

i!di + [i + lLa i+1 + • ■ ■ + (n)^-^ 1 «i!bi + (i+l],b 1+1 + ■ ■ • + (n)^-^ 1 
We can now apply the case i — to complete the argument. □ 

Remark 4.32. There is a useful picture to help remember this result that will 
reappear in higher dimensions. If we write the coefficients of f and g in the 
diagram 
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bo 



b 2 



ao 



ai 



0-2 



as 



then in each pair of adjacent triangles of a square the product of the coeffi- 
cients at the diagonally adjacent vertices is greater than the product of the 
coefficients at the remaining two vertices. 

Remark 4.33. We can derive these inequalities from Newton's inequalities. In 
fact, if f = Oi( x — Q i)' an d fj = f/( x — Q j), then Newton's inequalities im- 
ply that the determinant 1 14.6. lb is positive for the functions f and fj. Now 
the determinant is linear in the coefficients of g, so the coefficients of any con- 
vex combination of f, f \, . . . also satisfy 04.6. It . These convex combinations 
describe all the possible g's by Lemma [1.201 

If we restrict f, g to lie in P pos then 

Lemma 4.34. I/f = a + • • • + a n x n , g = b + • • • + b n x n , f«g,f,geP pos and 
f , g have positive leading coefficients then for all ^ r < s ^ n: 



b r b s 
a T a. 



>0 



Proof. If f , g G P pos then all coefficients are positive, so from Corollary 14.311 
we have the inequalities 

ar+i b r+ i a r +2 b r+ 2 a s b s 

a r b r a r+1 b r+1 a s _i b s _i 

Multiplying these inequalities yields the conclusion. 

□ 

In Lemma l4.34l we extended the Corollary l4.31l to determinants of the form 
I W+d bi I - Next, we observe that the coefficients of interlacing polynomials 
can not agree very often: 

Corollary 4.35. I/f <g are both in P v ° s / then f and g can not agree at any two 
coefficients. 

Proof. If f , g have coefficients as above, and = b^, a^+d = 'b^+a, then 
Lemma |4.34| is contradicted, for the determinant is 0. □ 

Here's another variation. 

Lemma 4.36. If^ — Y. a O<- x an d g = Y. br* 1 interlace, are linearly independent, 



and 



ao b 
ai bi 



then qq = bo = 0. 
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Proof. Since the determinant is zero there is a A such that A(qo, cli) = (bo, bi). 
If A ^ then by linear independence Af — g ^ 0, and f and Af — g interlace. 
Since Af — g has a factor of x 2 , f has a factor of x. Considering g and Af — g 
shows that g has a factor of x. 

If A = then g has a factor of x 2 , so f has a factor of x. □ 

Remark 4.37. If we form the matrix of coefficients of three mutually interlacing 
quadratics f < g C h, then we know that all two by two determinants are pos- 
itive (Corollary l4.31|l , but the three by three determinant can be either positive 
or negative. In particular, the matrix of coefficients is not totally positive. 



h 



(4 + x)(12 + x) 

9 = (3 + x)(9 + x) 

f = (2 + x)(8 + x) 

48 16 1 
27 12 1 
16 10 1 



h 



(8 
9 = (7 
f = (6- 



x)(12 + x) 
x)(10 + x) 
x)(9 + x) 



4 = 



96 20 
70 17 
54 15 



Next we have a kind of reverse Newton's inequalities for two by two de- 
terminants of interlacing polynomials. 

Lemma 4.38. Suppose that f = Y. a i xl an d 9 — Y. ^i.* 1 are ™ Pi n )- Iff^iQ then 

2 

^4 



b k 



bk+2 
ak+2 



(k + 2)(n-k) 



(k + l)(n-k-l) 



bk 
dk 



bk+i 
ak+i 



bk+i 
Qk+i 



bk+2 
Qk+2 



Proof. Since f <f; g we know that f + ccg e P for all a. If we let (3 = (k + 2) (u — 
k)/( (k + 1) (n — k — 1) ) then Newton's inequalities state that for all a we have 

(ak+i + ab k+ i) 2 ^ (3 (a k + ab k )(a k+2 + ab k+2 ) 

Since the difference is non-negative for all cc the discriminant is non-positive. 
A computation of the discriminant shows that 



P 2 



b k b k+2 
Qk Qk+2 

and the conclusion follows. 



4f3 



b k 
Qk 



bk+i 
Qk+l 



bk+i 
Qk+i 



bk+2 
a k +2 



s$ 



□ 



An equivalent formulation is that the determinant below has all real roots. 



1 VPx x 2 
bk b k+ i b k+ 2 
n k Q k +1 Q k +2 

Rosset [ 147J called the following corollary the cubic Newton inequality 
Corollary 4.39. Ifi (x) = Y. ^ € P pos (n) and k s$ n - 3 then 

Q k +2 2 — Qk+1 Qk+3) ^ (cLk+1 <lk+2 — CLk <l k+ 3) 2 



4 (ak+i - a k a k+2 
Proof. Apply the lemma to xf and f . 



□ 
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4.7 Log concavity in P pos 

The logarithms of the coefficients of a polynomial in P pos have useful and 
well-known concavity properties. We will revisit these ideas when we con- 
sider polynomials in two variables. 

To begin, suppose that f = do H h d n x n has all positive coefficients and 

is in P pos . We define the Newton graph of f to be the collection of segments 
joining the points 

(0, log do), (l,logai),- •■ , (n,loga n ). 
For instance, Figure l4~7l is the Newton graph for n = 4. 



(0, log ao) 




(4, log 04) 



Figure 4.1: The Newton graph of a polynomial in P pos 



The Newton graph is concave. Indeed, the slope of the segment between 
(k, log dk) and (k + l,logd k+ i) is log(di c+ i/d] C ). Taking logs of Newton's in- 
equality yields 

, a k+l , <lk+2 

log — — > log , 

dk d k+ i 

which means that the slopes are decreasing. This implies that the Newton 
graph is concave. 

Next, we consider the Newton graphs of two interlacing polynomials. As- 
sume that f <C g in P pos . In order to compare coefficients we will assume that 
both are monic, so that their Newton graphs agree at the right hand end- 
point. The Newton graph of f lies below the Newton graph of g, and the 
gap decreases as the index increases. For example, Figure 14.71 shows f and 
g = Y_ btx 1 . The dotted lines decrease in length as we move to the right. 

We use Lemma |4.34| which says that | ^ | > 0. Taking logs shows that 

, b k+ i d k+1 b k+ i b k 
l°g^— <l°g-— < lo § — 

D k d k d k+ i d k 

and these inequalities imply the assertions of the last paragraph. 



4.8 A pair of recursions 

In [188] there is an interesting pair of recursions 
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Figure 4.2: The Newton graphs of two interlacing polynomials 



Pn+l(z) = ZPn(z) - tX n Z n q n (l/z) 

qn+i(z) =zq n (z) - (3 n z n p n (l/z). 

If we homogenize these equations then we can find recurrences for p n and q n . 
Using these these recurrences we can easily show that p n and q n are in P for 
various choices of <x n and |3 n - So consider P(x,y) and Q(x,y) satisfying the 
recurrences 



Pn+i(x,y) = xP n (x,y) - a n y Q n (y,x) 
Qn+i(x,y) =xQ u (x,y) - |3 n y P n (y,x) 

with the initial conditions P (x,y) = Qo(x,y) — 1. We compute Q n+ i(y,x) 
using each equation 

1 1 

(y Pn+i (y , x) - P n +2 (y, xj) = (y P n (y , x) - P n +i (y, xj) - 



x% + i " xa n 

(3 n y Pn (y,x) 

Interchange x and y and simplify 

Pn+2(x,y) = ( x + — - —y ) P n+ i(x,y) + f a n +i(3 n - — : — J xy P n (x,y) 
\ oc n J \ On ) 

Since P n is homogeneous P n (x, 1) = p n (x) and 

Pn+2(x) = ( X + " '- — ) p n +l(x) + ( a n+ i(3 n - ) Xp n (x) 

\ 0C n J \ On J 

If the coefficient of p n is negative then this defines a sequence of orthogonal 
Laurent polynomials (p. 145) . In the special case that a n — |3 n = a for all n 
then p n = q n = f n and we get the simple recurrence 

f n+2 = (x+l)f n+1 + x(a 2 -l)f n (4.8.1) 
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Lemma 4.40. If\a\ < 1, f = l,fi = % - otand f sah's/ies (|483]l t/zerc f n e P alt 
and f n+ i <f n . 

Another interesting case is when oc n = |3 n = — q( n+1 )/ 2 . The resulting 
polynomials are known as the Rogers-Szego polynomials. The recurrence is 

fn+2 = (x + l)f n+1 + x(q n+1 - l)f n (4.8.2) 

It follows that 

Lemma 4.41. I/q > 1, f = \,i x = x + q and f sah's/ies g^2t f/zen f n e P Qlt and 

fn+l < f n . 



Chapter 




Analytic Functions 



We introduce some classes of analytic functions that are the uniform closure 
of certain sets of polynomials with all real roots. We are interested in the re- 
lationship between these classes, and the set of generating functions of linear 
transformations of the form x x i— > at* 1 that preserve roots. 

In later chapters we will generalize these results to more general linear 
transformations, and to polynomials in several variables. 

5.1 The uniform closure of P pos 

We begin with the simplest class of analytic functions, the closure of polyno- 
mials with all positive coefficients and all real roots. 

Definition 5.1. P pos is the set of all functions that are in the closure of P pos 
under uniform convergence on compact domains. 

From basic complex analysis we know 

1. All functions in P pos are analytic. 

2. All functions in P pos are entire - this means that they are defined on the 
entire complex plane. 

3. All derivatives of functions in P pos are in P pos . 

4. The product of two functions in P pos is in P pos . 

5. Every function in P pos has all real roots. 

It is obvious that P pos c P^ 5 . Thelimit of P Qlt is P alt , and satisfies the 
above properties, except that P alt C P alt . We now introduce a few important 
non-polynomial functions in P pos . 
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Example 5.2. The exponential function. e x is the most important non- 
polynomial function in P pos . To see that it is in P pos note that 

lim (l + -T = e x 

so e x is the limit of polynomials that have n-fold zeros at — n. These zeros 
go off to infinity so e x has no zeros. More generally, e ax is in P pos for any 

positive a. If a is negative then e ax e P alt . 
Also, since 

km 1 + x + — + ••• + — = e x 
n^oo \ 2! n! / 

we see that e x is also a limit of polynomials that are not in P, so e x is on the 

boundary of P^. 

Example 5.3. The q-exponential function. There is a q-analog of the exponential 
function. We first need to introduce the Pochhammer symbols 

(a;q) n = (l-a)(l-aq)-..(l-aq Tl - 1 ) 
(a; qU = (1 - a)(l - aq) • • • (1 - aq"- 1 ) ■ ■ • 



The following basic identity [67, (1.3.16)] is valid for |q| < 1: 

^■ (q;q)n 

We denote this function by Exp q (z). If we write out the right hand side of 
(5X1) 

{-%; q) M = (1 + z)(l + qz)(l + q 2 z) • • • 

then we see that for < q < 1 all the roots of Exp q (z) are negative, and 

so Expq(z) is in p pos . We will later use this variation (see |6|) on the q- 
exponential function, where [n] = (q n — l)(/(q — 1): 

oo f"-j 

E q (z) = V ^A- Z - = ((l-q) z; q) 00 |q| < 1 (5.1.2) 

Remark 5.4. p pos is closed under products, so we can construct new elements 
in p pos by multiplying appropriate factors. First of all, x T and e ax are always 
in p pos for r a non-negative integer, and a positive. If the infinite product 
f7J°(l + Cix) converges, where all d are positive, then the following is in P pos : 

oo 

cx r e QX ]^[(l + Cix) (5.1.3) 

i=l 

A famous result due to Polya-Schur [14, 138] states that this representation 
describes all the elements of P pos . 
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5.2 The uniform closure of P 

P is the uniform closure of P on compact subsets. It satisfies the same gen- 
eral analytic properties that P pos does. Here are some examples of analytic 
functions in P . 

Example 5.5. The Gaussian density. Since 



is in P \P P ° s since it is the limit of polynomials with both positive and negative 
roots. 

Example 5.6. Sine and Cosine. The sine and cosine are in P since they both 
have infinite products with positive and negative roots: 

sm(x)=xmi-^) 

k=l v ' 

3 5 
X X J X 3 

= I~ 3!~ + 5f 

CO 

cos(x) = Yl 

k=0 

X' X' 

- 1 -2! + 4!--" 



e- 2 = lim (l-t) = lim (l * ) ( 
we see that 

2 4 6 
^2 „ X X X° 

6 - 1 -l! + 2!-3! + - 



1 *«—) 



Example 5.7. Bessel Functions. The Bessel function of the first kind, Ji(z), for i 
a positive integer, is defined as 



_2k 



J i (z) = - T ^(-1 



2 iZ_^ J 2 2k k!(k- 



k=0 



There is an infinite product representation [72, page 980], where we denote 
the zeros of Ji(z) by Zi, m : 



Z\ l 1 i-r / Z' 



"w-(i)in 



2 



Z 2 

m= l \ i,m, 



The product representation shows that ]i[z) is in P. 

Inequalities for elements in P follow by taking limits of inequalities for P. 
For instance, Newton's inequalities l|4.3.2|l become a 2 , ^ a^-icik+i. Using 
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this observation, we derive Turan's inequality: If P n (x) is the n-th Legendre 
polynomial, then 

PnM 2 - Pn + lMPn-lM > for |x| < 1 (5.2.1) 

To see this, we need the identity 

00 ,.rt 

Y_? n {x)\ = e^J (yVl-x 2 ) 

n=0 n ' 

Thus, if |a| < 1 the series Y. Pnia)^ is in P. An application of Newton's 
inequalities @33) yields ^2J) . See IU691 . 

Example 5.8. The Gamma function Although the Gamma function is meromor- 
phic, its reciprocal is an entire function with the product [138, page 242]: 

1 + r) e~T ( i + !l) e -T ...(\ + 1) e -n . . . 



r(z+l) 



where y is Euler's constant. The roots are at the negative integers —1, —2, . . . 
Moreover, if we choose a positive integer k then the quotient 

G z 



r(kz + i) 

is an entire function since the poles of the numerator are contained among 
the poles of the denominator. In addition, G(z) is in P since it has a product 
formula. 

Remark 5.9. There is an explicit description II14U138I of the elements of P that 
is slightly more complicated than the description of the elements of P pos . The 
elements of P have the product representation 

oo 

cx r e ax-bx 2 j"[(l + CiX ) e -Ci* (5 2 2) 

i=l 

where a, C\, . . . are real, r is a non-negative integer, and b is non-negative. 
Notice that e" 2 is not in P . 

We can use trigonometric identities to find some non-obvious members of 
P. Recall that 

1 x + y x — y 

- sin — - — cos — - — = smx + smu 

2 2 2 

If we choose a so that |q| ^ 1, then we can find y so that siny = a, and hence 

1 x + a x — a 
sm(x) + a = - sm — - — cos — - — 

2 2 2 

Since the right hand side is a product of members of P, it follows that a + 
sin x e P. Multiplying several of these together yields half of 
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Lemma 5.10. Suppose that f (x) is a polynomial. Then, f (x) £ P l ' ' if and only if 
f(sinx) e P- 

Proof. Conversely, let r\, ...,r n be the roots of f(x). Then the roots of f(sinx) 
are solutions to sinx = x- x . If any rt has absolute value greater than 1 or is not 
real, then arcsinr^ is also not real, so f(sinx) has non-real roots. □ 

5.3 The Hadamard Factorization Theorem 

How can we show that a function f is in P or P pos ? It is not enough to know 
that all the roots are real - for example e* 2 has no complex roots and yet is not 
in P . If all roots of f are real we can determine if f is in P using the coefficients 
of the Taylor series. 

The Hadamard factorization theorem represents an arbitrary entire func- 
tion as an infinite product times an exponential factor. We first summarize 
the necessary background (see [149J or |15|), and then use the machinery to 
exhibit a collection of functions whose exponential part satisfies the necessary 
conditions to belong to P . 

We begin with definitions. If p is a non-negative integer, we define the 
Weierstrass primary factor of order p by 

E(z,0) = (l-z) 

z 2 z p 
E(z,p) = (1 -z) exp(z +- + ■•• + —) 

2 P 

For a set Z of complex numbers, the genus ofZ, p = p(Z) is defined by 

p(Z) = inf \ q : q is an integer, Y ] < oo \ 
I z^z |zi|q J 

The canonical product over Z is defined by 

Pz.^nO) 

where p = p(Z). The maximum modulus of f (z) on the circle |z| = r is denoted 
M(r). The order p = p(f ) of f(z) is 

loglogM(r) 

p = lim sup — 

r^oo logr 

We can now state 

Theorem 5.11 (Hadamard Factorization Theorem). If f (z) is an entire function 
of order p with an m-fold zero at the origin, we have 

f(z] =z m e Q(z) P z (z) 
where Q (z) is a polynomial of degree at most p, and Z is the set of zeros ofi. 
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The particular case we are interested in is p < 1 and p = 0. In this case 

f (z) = z m e az Yl ( a - ~) ( 5 - 31 ) 

If all the roots zi are negative then f 6 pP os / anc j f G P if there are positive 
and negative roots. The main difficulty is showing that all the roots are real. 

How can we determine the two parameters p and p? First of all, it is known 
that p < p. Second, it is possible to find p from the Taylor series of f . Suppose 
that 

CXI 

f (z) = y_ ^ 

i=0 

If the limit 

limsup-^- (5.3.2) 

n^o/ log(l/|a n |) 

is finite then its value is p. 

We are in a position to determine the exponential part. We begin with the 
Bessel function J (— isfx). 

Example 5.12. The Bessel function. 

OO i 

Jo(-VZ) = )"^eP^ 
* — i! v. 

i=0 

We first recall Stirling's formulas: 



n! - V2jmn n e- n 



1 

logn! - — log(27tn) + nlogn — n 



The limit j53!2l is 



, nlogn nlogn 1 

lim sup - — —2 — — = hm sup — = = - 

log(n!n!) 2(± log (27m) + nlogn - n) 2 

Thus p = \, and hence p = 0. This shows that we have the desired represen- 
tation (|5.3.1|l , and since all terms are positive there are no positive zeros. Since 

the Bessel function is known to have all real zeros the series is in P pos . 
The general Bessel function has a series expansion 

J-v(z) has infinitely many real roots for all v, and all of its roots are real for 
— 1. Since a similar computation shows that p = for J^, (z), it follows that 

J-v G P for-O -1. 
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We now give some examples of computations of p; since we don't know if 
the functions have all real roots, we can't conclude that they are in P . 



Example 5.13. Suppose that a\, . . ., a T and bj, . . . , b s are positive. Then 

^ (ain)!---(a T n)! 

Z_ 7Z — Vt TZ — m z has p < 1 

bin ! • • • b s n ! 



T1=0 



if (Libk)-(L>k) >1. 

Compute the limit using Stirling's formula: 

nlogn nlog 
llmsu P T^TrTTi; ZXi i -i-Tf^ = limsup 



logn(bkn)!-logn(ikn)! r 21 b k nlogb k n - £ Qknlog a k n 

1 



L b k - L Q k 

If the hypothesis are satisfied, then the last expression is less than 1, and p < 1. 

Example 5.14. Here is a similar example. Suppose that a\,...,a r and 
bi, . . . , b s are positive. We define the hypergeometric series 

(qj) • • • (a T ) z n 
r F s ai,...,a T ;bi,...,b s ;z = T i^v 1 ^ 

If we compute the limit as above, we find that it is 1+ g_ T ■ If s — r > then 
p < 1. Hurwitz showed that the series 

has all real roots if bi —1 and all the other bi are positive. 
Example 5.15. A q -series. Consider 

1=0 

The series converges if |q| > 1, or q = — 1. The limit (|5.3.2b in either case is 

0: 

nlogn nlogn 

lim sup r— — = lim sup "T~^ll = 

n^oo log(n!q' 2 J) rt^oo nlogn+-^ — logq 

and so p = p = and again the representation 115.3. l|l holds. If q > 1 all terms 
are positive, and if q = — 1 there are positive and negative signs. 

Example 5.16. A q-factorial. Consider the two series for q > 1. 

y and y J- 
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The limit for the first series is 



nlogn nlogn 

limsup — «-n : — - — = hmsup f — = 0. 

K nlogn + Y.i=o !og(l - q l ) n.-*/ nlogn + ^=il log q 



n^oo 



The terms alternate in sign the first series. If q < — 1 then all terms of the 
second series are positive. 



Example 5.17. Suppose r > 0. If we compute J5.3.2I I for 

'r + n-T 

n=0 



we find that p is infinite, which isn't surprising since this is not an entire func- 
tion. However, 

n=0 v 7 

has p = 1. A result of Polya-Schur (Lemma 115.251 1 says that if f e P, and all 
coefficients are non-negative then f 6 P pos . If we knew that EXP(1 — x)~ T e P 
then we could conclude that EXP(1 - x)~ r e pP° s . 

Remark5.18. Considerf = Y" (a)^. Since EXPf = (l+x) a weseethatEXPf ^ 

P. However, when a = 1/2 we have the identity that EXP 2 f = e x/2 I (x/2) 
where Io is the modified Bessel function of the first kind. Since 

oo j 

* — i!x! 

i=0 

we have that EXP 2 f G P. 



5.4 Polynomials with rapidly decreasing coefficients 

If the coefficients of a polynomial are decreasing sufficiently rapidly then the 
polynomial has all real roots. To motivate this, suppose that f e P(n), and 
define g(x) = f + ocx n+1 . If a is very small then the graph of g looks like 
Figure 15.11 There are n roots of g that are close to the roots of f , so g has all 
real roots. 

The following theorem of Kurtz [110| gives a sufficient condition for the 
existence of all real roots. 

Theorem 5.19. Let f = Y. a i- x ^ ^ e a polynomial of degree n > 1 with all positive 
coefficients. If 

a? -4at_ia i+1 > i = 1,2, . . .,n - 1 (5.4.1) 
then all roots off are real and distinct. 
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If we have equality in 05.4.11 1 and Qq = 1 then the polynomial is 

n. 



k=0 

and the coefficients are rapidly decreasing. 

Corollary 5.20. Ifoc^i then the series below is in P and all of its partial sums are 
in P. 

oo 
k=0 

Corollary 5.21. Ifa.^2 then then the series below is in P and all of its partial sums 
are in P. 

oo 

y x k aT k2 

k=0 

Proof. Use the fact that a ^ 2 to check that 

„ 2 \2 



(a- k2 ) > 4a- (k - 1 »V< k+1 ' 2 

□ 

Corollary 5.22. If q > 4 then then the series below is in P and all of its partial sums 
are in P. 

oo v 

LX K 
Ml 



[k]! 

k=0 



Proof. The Newton quotient is 



[n + 2]! [n]! _ [n + 2] _ q n+2 - 1 

([n + 1]!) 2 "[^+l]"q^-l >0 ' 



□ 
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Corollary ESS is in 11381 V 176]. In IT391 III 200] they show that there is 

exactly one root in each annulus ct 2n ~ 2 < \z\ < a 2n for n = 1,2, There is a 

similar result due to Kuperberg [109[. 



Lemma 5.23. Ifq^A then the series below is in P, and all partial sums are in P. 

LX K t— X K 

i =Q J^q) = 2- (l-q)(l-q2)--.(l-qk) 

Proof. Kuperberg proves this without reference to Kurtz's result. He shows 
that if F q n is the sum of the first n terms of the series and q 4 then 
Fq,n( c l k+ ^ ) alternates in sign as k ranges from to n. □ 

If all the partial sums of an infinite series have all real roots, then the coef- 
ficients satisfy a quadratic exponential bound, but the constant is y/2, not 4 as 
we would like. 

Lemma 5.24. IfY. Q i xl ^ as a ^ positive terms, and all partial sums have all real roots 
then 

Proof. Since Qo + aix + • • • + a n x n has all real roots we can apply Newton's 
inequalities 114.3.21 



a n a n - 2 



Qn-i >2 n a n -2 > 2 n 2 n - 1 Qn-3 > 
a n ^ n — 1 a n _i ^ n — 1 n — 2 a n _2 ^ 



Ql 

2 



a / n \ a 

from which l !5.4.2t follows. 



□ 

We end the section with a result where the terms are increasing. 
Lemma 5.25. Suppose ^ ao ^ ai < a2 • • • < • • • . Then the polynomials 

n n 

f n = Y_ a i (-!) v felt gn = ^a i (x) i 

i=0 i=0 

have all real roots. In addition, f n+1 <f n and g n +i < 9n- 
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Proof. The second case follows from the first upon substituting — x for x in 
the first case. Note that if k is an integer satisfying 1 $C k ^ n then f n (k) = 
^o _1 cut— l) 1 (k) i . As a function of i the sequence {at} is non-decreasing, and 
{(kJi.} is increasing, so the sign of f n (k) is the sign of the largest term, and 
hence is (— 1) Consequently, f n sign interlaces (x) n , and so in in P. Since 
fn+i =fn + a n+ i(-l) n+1 (xj n+1 it follows that f n+ i<f n . □ 

5.5 Interlacing of analytic functions 

We use Proposition 11.351 to motivate our definition of interlacing for analytic 
functions in P. 

Definition 5.26. If f, g S P then we say that f and g interlace if f + ag is in P 
for all real a. 

Later in this section we will relate interlacing of analytic functions to the in- 
terlacing properties of their zeros. Although it might be hard to verify directly 
that a linear combination is in P we can establish interlacing using limits. 

Lemma 5.27. Suppose that f, g G P and we have sequences of polynomials 
(fn)/ (g-n) where the degree of f n and g n is n such that f n — ► f and g n — > g. 
If f n and g n interlace for every n then f and g interlace. 

Proof. Since f n and g n interlace we know that f n + ag n is in P for all a. Taking 
limits, we see that f + ag is in P and hence f, g interlace. □ 

Just as for polynomials, the derivative is interlacing. 

Lemma 5.28. Iff € P then f e P , and f and f interlace. 

Proof. If f n is a sequence of polynomials that converge to f, then the deriva- 
tives f ' n converge to f ' . Since f n + <xf ' n converges to f + ocf ' it follows that f and 
f interlace. □ 

Remark 5.29. Exponential functions e ax and e$ x only interlace if <x — (3. In- 
deed, if they did interlace, then e^ + e 13 * 6 P. However, 

e ax + e p x =e ocx(l + e (P-oOxj 

has complex roots if a ^ (3. The next three results generalize this simple 
observation. 

We first see that no linear combination of polynomials and exponentials is 
in P2. 

Lemma 5.30. If f (%], g(x) are polynomials and g(x)e x + f (x) £ P f/ze« f = and 
geP. 
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Proof. We first show that if c e Mthene x + c £ P. If it were then (e D +c)(x n ) = 
(x + l) n + cx n e P, but the roots are complex for n ^ 3. 

Next, if e x + f (x) g P where f has degree d then the d'th derivative is in P 
and is of the form e x + c. This contradicts the first paragraph. 

If g(x)e x + c <E P then replacing x by — x and multiplying by e x shows that 
g(— x) + c e x e P, which we know is not possible. 

Finally if g(x)e x +f (x) e P then the d'th derivative is of the form G(x)e x +c, 
so f must be zero. Multiplying by e~ x shows that g e P. □ 

Next, it is easy to describe the polynomials interlacing e x . 

Lemma 5.31. J/f~e x then f = (ax + |3)e x . 

Proof. The hypothesis means that f (x) + ae x e P for all real a. Thus 
(f + ae x )(D)x n e P for all positive integers n. It follows that 

f(D)x n ~e D x n = (x + l) n 

We can determine f since we know all polynomials interlacing (x + l) n . Since 
f (D)x n has degree at most n it follows that there are a n , b n such that 

f (D)x n = a n (x + l) n + b n (x + 

If we write f (x) = Y. diX 1 then equating terms yields 

d 1 D i x n = &iX n - \ U! M , = x" 
[n-x)\ 

and so 

A 1 n b T 

di = — + 













!)] 









i! n (i-1)! 

Setting i = shows that a n — do, and i = 1 yields b n /n = di — do- Thus 

1 1 
di = d T7 + (di-do) — 



'i! u '(i-l)! 
and consequently 

f(x) = d e x + (di-d )xe x . 
More generally we have 



□ 



Lemma 5.32. If f (x) ~ g(x)e x where g(x) is a polynomial then there is a polynomial 
h~ g such that f (x) = h(x)e x . 



CHAPTER 5. ANALYTIC FUNCTIONS 



133 



Proof. We first show that it suffices to show that f(x) = H(x)e x for some poly- 
nomial. If so, then since f + age x e P it follows that (H + ag)e x e P, and 
therefore g + aH £ P. Thus, g and H interlace. 

Assume that g has degree r, and write f = Y. dix\ If n > r + 1 then 
g(D)e D x n = g(D)(x + l) n hasafactorof (x + 1) 2 . Since f(D)x n ~ g(D)(x + l) n 
it follows that 
n > r + 1: 



1 is a root of f(D)x n . This gives us the equations, valid for 
-1) =n!^di- 



-11 







that we view as an infinite matrix equation. For example, if T = 1 we have 

1 



/ -1/6 1/2 

1/24 -1/6 

-1/120 1/24 

V : 



-1 
1/2 -1 
-1/6 1/2 





1 

-1 1 



di 
d 2 



J 



\'-J 








(5.5.1) 



In this example it is clear that we can solve for & 3 , di, . . . in terms of do, di, d 2 . 
If M is the lower triangular matrix whose ij entry is (— l) l ~'/U — j)! then the 
matrix above if formed by removing the first three rows of M. Now the in- 
verse of M has entries l/(i — j)!, and if we multiply 115.5.11 by M _1 we recove 
the identity matrix after r + 1 columns. In the example, the result of multiply- 
ing by M _1 is 



/ 


-1/6 


1/2 


1 


1 











...\ 


/d \ 








-1/8 


1/3 


-1/2 





1 










di 









-1/20 


1/8 


-1/6 








1 







d 2 









-1/72 


1/30 


-1/24 











1 




d 3 







V 


















V-J 




W 



Consequently we find 

d 3 = 1 
d 4 = \ 



dn = 



d 2 - 
d 2 - 

d 2 - 



n-2 



(n-2)! ^ (n-1)! 

Adding up the series determines f : 



di 
di 

di 



(Tt-l)(n-2)/2 



| do 
I do 

do 



do + dix + d 2 x 2 + d 2 y 



n=3 



n-2)! 



oo - oo /n— 1\ 

n=3 V '' n=3 



n! 



= e x ( d 2 x 2 + di(x-x 2 ) + d (l-x+-x 2 ) 



which shows that f is e x times a quadratic. 
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In the general case we follow the same approach, and find that f equals 

r+l /T+l-i 



k! 



i=0 v k=0 

which is a polynomial in x of degree r+l times an exponential, as desired. 

□ 

We now define interlacing of zeros: 

Definition 5.33. If f and g are in P then we say that the zeros off and g interlace 
if there is a zero of f between every pair of zeros of g, and a zero of g between 
every two zeros of f . If the set of zeros of f and g have least zeros Zf and z g 
then we say that f < g if f and g interlace, and Zf ^ z g . 

If the zeros of functions in P pos interlace and the exponential factors are 
the same, then the functions interlace. We would like to prove the converse. 

Lemma 5.34. Suppose that 

00 00 
f = z T e vx n(l + c i x) g =x T e Y *n(l + d i x) 

i=l i=l 

are in P v ° s where c\ > C2 > • • • > and d\ > d2 > ■ ■ ■ > 0. Then f and g interlace 
if 

ci > di > C2 > d2 . . . or di > ci > d2 > C2 . . . 
Proof. Since the Ci's and the dt's interlace then the partial products 

Tl Tl 

fn=n< i+c ^ gn=nt i+d ^ 

t=l i=l 

interlace, and converge to interlacing functions. If we then multiply these 
interlacing functions by x T e yx we find that f and g interlace. □ 

We can generalize part of Lemma [l.20l 

Lemma 5.35. Suppose that f(x) = fT(l— x/cu) isinP v ° s . Choose positive on such 
that Y_ «4 < 00. Then 



2. Q = > oa- — g P pos 

interlace. 



1. g = V_ at- — 

* — 1 — x/ai 

2. f and 5~ on- — —, — 

L — 1 — X/Oa 

Proof. Consider 

n n ^ 

fn = TT(l-x/aO and g n = Y" on- — 

± X/ CLi 

A 1 1 ' 



i=l i=l 



We know that f n <g n - The assumption on ai implies that g n converges uni- 
formly to g. The result now follows by taking limits. □ 
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5.6 Characterizing P 

The following theorem shows that the functions in P are exactly the right func- 
tions to generalize polynomials. 

Theorem 5.36. An analytic function f satisfies f ( D)P c P if and only if I is in P. 
It satisfies f (D)P pos c P v ° s if and only if it is in P?™. 

Proof. If f is in P, then it is the uniform limit of polynomials. The contain- 
ment is true for each of the polynomials by Corollary l7.15l so it is true for f. 
The converse is more interesting, and follows an argument due to Polya and 
Schur[137|. First write f (x) = Y. Q i xl / an d set q n = f(D)x n . Expanding the 
series 

n 

i=0 

and reversing q n gives a polynomial in P 

n 

t=0 

If we now replace x by x/n we find 

p 1 ,(x/u)=^a i x n - i n(l--) 

i=0 k=l 

The polynomials p n (x/ n) converge uniformly to f , and so f is in P . The second 
case is similar. □ 



5.7 P-positive semi-definite matrices 

We derive some simple inequalities for quadratic forms involving the coeffi- 
cients of a polynomial with all real roots. It is interesting that these inequalities 
are valid for a broader class of functions than just P . 

Define an n by n matrix Q to be P-positive semi-definite if for every f = 
YJ) CM.x l € P and A = (ao, ■ • •, Q n -i), we have AQA* ^ 0. Of course, any 
positive semi-definite matrix is also P-positive semi-definite, but the converse 

is not true. In particular, P-positive semi-definite matrices can have negative 

/ o o -2 \ 

entries. Indeed, Newton's inequalities imply that the matrix f l o J is P- 

positive semi-definite. 

We first note that it is easy to construct new P-positive semi-definite ma- 
trices from old ones. It's obvious that P-positive semi-definite matrices form 
a cone. In addition, 
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Lemma 5.37. IfQ is a P-positive semi-definite matrix, and B is the diagonal matrix 
of coefficients of a polynomial in p v ° s , then BQB is also P-positive semi-definite. 

Proof. If A is the vector of coefficients of f e P, then AB is the vector of coef- 
ficients of the Hadamard product of f and g. By Theorem 19 . 841 the Hadamard 
product is in P, it follows that A(BQB) A 1 = (AB)Q(AB) t ^ 0. □ 

Here is a construction of a P-positive semi-definite matrix. Note that it 
applies to polynomials in P. 

Proposition 5.38. Suppose that 

OO / 

f(y) = Y_ aiU 1 = e- 9(y)+H(y) Yl (l - - 



i=0 



where h(y) is a polynomial with only terms of odd degree, g(y) has all non-negative 
coefficients, and all exponents ofg are = 2 (mod 4). Ifm is a positive integer then 

m 

Y_ (-l) m+j Q m -ja m+j ^0 
j=— m 

Since all real rooted polynomials satisfy the hypothesis, the 2ra + lby 2m + 1 matrix 
whose anti-diagonal is alternating ±1 beginning and ending with 1 is P-positive 
semi-definite. 

Proof. The coefficient of y 2m in 

f(u)f(-v) = (X a ^ s ) (L Q >-(-^ T ^ r 

is 

m 

^ (-lj'anx-jam+j 

j=-m 

On the other hand, since h(y) + h(— y) =0 and g(y) = g(— y) by hypothesis, 
we have 



f(y)f(-y) = ^e-9<^+ H ^' EI ( a - 7)) (e-^ +h ^ IT ( a + 7 

n (1 - (5.7.1) 



Since r? is positive, the sign of the coefficient of y 2m in the infinite product 
is (— l) m . The coefficient of y 2m in e~ 2g(yl is either (— l) m or 0. Consequently, 
the sign of the coefficient of y 2m in d5.7.1| | is also (— l) m or 0, which finishes 
the proof. □ 

The case m = 1 in the following corollary is the Laguerre inequality. 
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Corollary 5.39. Suppose that f e P, and m z's a positive integer. Then 
2- 1 J (m-j)! (m + j)! " 



j=— m 



More generally, ifQ = (qy ) z's a P -positive semi-definite matrix, then 

_ f(i) f(i) 

5>nrir* 



Proof. Use the Taylor series 



f(x + -y) = ff ,s| (x)^ 



□ 



Corollary 5.40. Zf P n (x) z's ff*ie Legendre polynomial, m is a positive integer, and 
|x| < 1 f/zen 

Li nm+j Pm-j( x ) Pm+jl^) ^ n 
1 1J (m-j)! (m + j)! ^° 

i=— m 

Proof. The following identity is valid for |x| < 1. 



T1=0 



n! 



Now apply Proposition l5.38l 



□ 



Remark 5.41. Krasikov|107J essentially shows that the matrix below (derived 
from his Viff )) is P-positive semi-definite. 



(0 











- 2 \ 











-1 











2 











1 











V-2 











0/ 



To see this directly notice we can write it as a sum of P-positive semi-definite 
matrices. 


















f 











- 3 \ 











-1 




























1 








+ 








1 











-1 




























V 











V 




V-3 











0/ 



The first one is P-positive semi-definite by the proposition. From equation 
| |4.3. 51 we have a| 6aoa4 which shows the second matrix is P-positive semi- 
definite. More generally if < k < n then Newton's inequalities imply that 
the matrix with 1 in the (k, k) entry and -| ^ in the (k + 1, k - 1 ) and 

(k — 1, k + 1) entries is a P-positive semi-definite matrix. 



Chapter 




Linear Transformations of 
Polynomials 



The aim of this chapter is to establish properties of linear transformations that 
preserve roots. In Section l6Al we show that if we put assumptions on T such 
as Tf < f for all f e P then Tf is the derivative. If T is bijection on P, then T 
is affine. We see how to determine the possible domain and range of a lin- 
ear transformation. In the next section we study transformations of the form 
f i ► f(T)(l) where T is a linear transformation. The following two sections 
consider properties of linear transformations that are defined by recurrence 
relations. We then consider the effect of Mobius transformations on the the 
roots of polynomials and on transformations. In the final section we begin the 
study of transformations of the form x n i— > 0™(x + at). 



6.1 Characterizing transformations 

Many linear transformations can be characterized by their mapping and in- 
terlacing properties. The derivative can be characterized by its fundamental 
interlacing property f <,f. This result relies on the fact that the only monic 
polynomial f such that (x + a) n <jf is f = (x + a) n_1 . 

Theorem 6.1. If T is a linear transformation on polynomials such that f <T(f) for 
all polynomials f in P alt (or for all polynomials in P v ° s ), then T is a multiple of the 
derivative. 

Proof. Choosing f = x m , we find x m <T(x m ), and hence there is a c m so that 
T(x m ) = c m x m_1 . From (x + l) m < T(x + l) m we see that there an a m such 
that 

r \ m— 1 / t\ 



Y_[ ! J c i x i - 1 =T(x + l) m = a m (x + l)— : = a m £ 



i=0 
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Equating coefficients shows that Ct = (a m /m) i, and so T(f) is a multiple of 
f. ' □ 

Theorem 6.2. If T is a linear transformation with the property that Tf and f have 
the same degree, and interlace for all f in P alt (or for all f in P pos ) then there are 
constants a, b, c such that 

Tf = af + bxf + cf ' 
If Tf and f interlace for all f € P then Tf = af + cf. 

Proof. By continuity x n and T(x n ) interlace, and so there are constants Q n ,b n 
such that T(x n ) = a n x n + b n x n_1 . Apply T to the polynomial f = x T (x — a) 2 . 
We compute 

Tf = T(x T+2 - 2ctx r+1 + aV) (6.1.1) 
= a r+2 x r+2 + b r+2 x r+1 - 2cta T+1 x r+1 - 
2ab r+ ix r + oc 2 a r x T + a 2 b T x T_1 

Now f has a double root at a, and Tf interlaces f, so Tf must have a as a 
root. Substituting x = a in d6-l.lt gives the recurrence relation 

= a T+2 (a r+2 - 2a r+ i + a r ) + a r+1 (b T+2 - 2b r+ i + b r ) 

This equation holds for infinitely many a, so we get the two recurrences 

= a r+2 - 2a T+1 + a r 
= b r+2 -2b T+1 + b r 

Since these recurrences hold for 1 ^ r ^ n — 2, we solve these equations and 
find that constants a, b, c, d so that a r = a + br, b T = d + cr. Substituting these 
values gives 

T(x r ) = (a + br)x r + (d + crjx 1 "- 1 

Since rx r = xDx r , it remains to show that d = 0. We know that for any 
negative a the polynomials (x + a) 3 and T(x+ a) 3 interlace, and hence (x+ a) 2 
divides T(x + a) 3 . A computation shows that the remainder of T(x + a) 3 upon 
division by (x + a) 2 is da(x + 2a). As this remainder is we conclude that 
d = 0. The case where f G P pos is similar and omitted. 

Finally, assume that Tf and f interlace for all f S P. If b is not zero then we 
may divide Tf by b, and thus assume that Tf = af + xf + cf. Choose a > |c| 
and consider the polynomial 

f = (x-a)(x + a)(x-a-l)(x + a+l) 
A computation shows that 



(Tf)(-a) =2(c-a)a(l + 2a) 
(Tf)(a) =-2(c + a)a(l + 2a) 
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The signs of the left hand side must alternate since f and Tf sign interlace, and 
so (c — ct)(c + a) > which implies |c| > a. Since we chose a > |c| this is a 
contradiction, and hence b = 0. □ 

The remaining case is Tf < f . Using an argument entirely similar to that of 
Theorem 1 6 . 2 1 we find 

Theorem 6.3. If J is a linear transformation such that Tf < f for alii £ P then there 
are constants a, b, c where a and c have the same sign such that 

Tf = axf + bf + cf ' 

Remark 6.4. If we restrict the domain then we can find transformations that 
are not the derivative that satisfy Tf < f. Here are three examples: 

T(f) = (x 2 -l)f onP ( - u) 
T(f) = D((x 2 -l)f) onP hul 

yl+l 

T(x n ) = onEXP(P) 

n + 1 

We can generalize Theorem 16.11 by restricting the domain of the transfor- 
mation to polynomials of degree n. Under these restrictions the polar deriva- 
tive makes an appearance. 

Theorem 6.5. Fix a positive integer n and suppose that T is a linear transformation 
that maps homogeneous polynomials of degree n to homogeneous polynomials of de- 
gree n — 1 with the property that F < T(F) for all f e P{n)for which F(x, 1) = f(x). 
There are b, c where be > such that 

7(F) 

ox oy 

Proof. Since T is continuous we observe that if F is the limit of polynomials of 
degree n then F and TF interlace. Considering that 

limx r (l + ex) n - T = x T 

we conclude that x T and T(x r ) interlace for all r ^ n. Thus there are a r , b r , c r 
such that 

T(x r ) =x r_1 (a r +b r x + c r x 2 ) 

Since T(x r (x — a) 2 ) is interlaced by x r (x — a) 2 , it has a, for a root, and we find 

=a r+2 (a r+2 + b r+2 a + c r+2 a 2 ) 
— 2a r+1 (a r+ i + b r+ ia+ c T+ ia 2 ) 
+ a T (a T + b T a + c T a 2 ) 
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Since this is true for all a we find that 

= a r+2 - 2a r+ i + a r 
= b r+2 - 2b r+1 + b r 
= c r +2 - 2c T+ i + c T 

Solving these recurrences yields a r = A + Air, b T = B + Bir, c r = Co + Qr 
which implies 

T(x r ) = (Ao + Airjx 1 "- 1 + (B + Bir)x r + (C + Cir)x T+1 

Since T(x n ) is a polynomial of degree n— 1 we find Bo+Bin = and Co+Cjn 
so that 

T(x r ) = (A + A 1 r)x T - 1 + B 1 (r-n)x r + Ci(r-n)x r+1 

Since 

Kx^-^x + l)) = (Ao + AmJfx^J + fAo + Aifn-lJjx 11 - 2 - 
Bjx 1 ^ 1 - Cix n 

has degree n — 1 we see that Ci = 0. It remains to see that Ao = 0. If not, we 
may assume Ao = 1 and consider 

T((x-a) 3 ) = (x 2 -3xa+3) +3Ai(x-a) 2 -Bi(-3x 2 a + 6xa 2 -3a 3 ) 

whose discriminant is 3a 2 (—1 — 4Aj — 4Bia). Since a is arbitrary this can 
be negative unless Bj = 0. Thus T(x T ) = x T_1 + Airx r_1 . We see that TF 
has degree r — 1 so F < TF. The proof of Theorem 16.1 1 shows that this can not 
happen, so A is 0, and T(x r ) = Ajfrx 1 " -1 ) + B^r — n)x r which implies the 
conclusion by linearity. □ 

If f is a polynomial then f (D) (x + a) n is a sum of constants times powers 
of x + a. The next lemma is a converse. 

Lemma 6.6. Suppose!: P — ► P satisfies 

k 

T(x + a) k =^a k ,i (x+a) 1 

i=0 

for all a € R, where a^t are constants not depending on a. Then there is anfeP 
such that! = f(D). 
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Proof. Continuity of T implies that D and T commute: 



TD(x+a) k =Tlim 



(x + a + e) k — (x + a) k 



o e 
_ j. Li «k,t(x + a + e) 1 - Y.j «k,i(x + a) 1 
e^O e 

= Y_ <*Ki(* + a) 14 " 1 = DT(x + a) k 

Now, any linear operator commuting with differentiation is a power series 
f(x) in D. Since f (D) maps P to itself, it follows from Theorem 15.361 that f e 
P. □ 

The proof of the following lemma (due to Carncier-Pinkus-Pena|30[) 
clearly shows the significance of the monic hypothesis. The proof reduces 
to the fact that we know all the polynomials interlacing (x + l) n . 

Lemma 6.7. If J : P(n) — ► P(n) satisfies 

1. T(x k ) = x k + terms of lower degree for < k ^ n. 

2. T(x k ) G P vos forO^ k n. 
then there is a polynomial f such that 

1. T =f(D) 

2. EXP(f) G P 

Proof. Let T(x k ) = Lo a Ki x ° where = 1. We first describe the argument 
for n = 3. Consider T* (x + y ) 3 : 



y 3 

3a 2y 2 3xy 2 

4a iy 6a u xy 3x 2 y 

Qoo a-w'x- q 20 x 2 x 3 
The monic hypothesis implies that the rightmost diagonal is (x+y ) 3 . Since 
T* (x + y ) 3 G P2° s the diagonals interlace, so the adjacent diagonal is a multiple 
of (x + y) 2 , and so on. Thus T*(x + y) 3 = 

y 3 

a 2 oy 2 3xy 2 

aioy 2a 2 oxy 3x 2 y 

Qqo QIOX Q2QX 2 x 3 
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If f = ^bix 1 then f(D) (x + y) 3 = 

3y 2 bi xy 2 b 

6y b 2 6xy bi x 2 y b 

6b 3 6xb 2 3x 2 bi x 3 b 
Equating coefficients shows that if g — aoo + Qio x + Q2ox 2 + Q3ox 3 then 

T^^exp- 1 g Tev )(D) 

In general, the rightmost diagonal of T* (x+y ) n is (x+y ) n , so all the parallel 
diagonals are powers of x + y . Thus 

n 

T,(x + y) n ^^Qiofx + u) 1 

1=0 

If g = Y.i=o a i0 x v then 

(^exp" 1 g rev )(D)(x + ii) Tl =^^^a n _ w D l )(x + y) n 

n 

^^Qiofx + y) 1 = T*(x + y) n 

i=0 

□ 

Affine transformations are the only transformations whose inverses also 
preserve roots. 

Theorem 6.8. If a degree preserving linear transformation and its inverse both pre- 
serve roots then the linear transformation is an affine transformation. More precisely, 
if! is a linear transformation such that 

• T(P) CP 

• T-^P) c P 

• deg(T(x n )) =n 

then there are constants a, b, c such that Tf (x) = cf (ax + b). 

Proof. Using the three constants, we may assume that T(l) = 1 and T(x) = x. 
With these normalizations, it suffices to show that Tf = f . Given a polynomial 
f S P, we can use Theorem II, 431 to apply T _1 to the interlacing Tf < (Tf)' to 
find that f <T _1 (Tf)'. Since T~ : (Tf)' is a linear transformation defined on all 
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of P, there is a constant w such that T _1 (Tf)' = wf, or equivalently wT(f') = 



We now can show that T(x 2 ) = x 2 . Assume that T(x 2 ) = ax 2 + |3x + y. 
Since 



we find that T(x 2 ) = wx 2 + y. Solving for T _1 gives T _1 (x 2 ) = (1/w) (x 2 — y). 
If y is non-zero, then it is not possible that both T(x 2 ) and T~ x (x 2 ) have real 
roots, so T(x 2 ) = wx 2 . Since T((x — l) 2 ) = wx 2 — 2x + 1 has all real roots, 
the discriminant is non-negative which implies that w 1. Similarly, since 
T _1 (x 2 ) has all real roots then (1/w) ^ 1, and so w = 1. 

If we assume by induction that T(x n_1 ) = x n_1 then applying T(f') = (Tf)' 
shows that T(x n ) = x n + oc. This polynomial does not have all real roots unless 
oc = 0, soT(x n ) = x n and thus Tf = f . □ 

Remark 6.9. There is an alternate argument for the end of the proof. If T 
commutes with differentiation D then T = f(D) for some f € P. Similarly 
T _1 = f _1 (D), so f is invertible and has no zeros. The only functions of this 
form in P are f (x) = ae bx , and ae bD g(x) = ag(x + b). 

Corollary 6.10. If T is an invertible linear transformation such that TP = P then 
Tf = cf(ax + b). 

Remark 6.11. Although the only bijections on P are trivial, there are non-trivial 
bijections on P alt . If Ln"' (x) is the monic Laguerre polynomial, (§ 1717.1011 then 
1 146] the linear transformation x n i— > L"M satisfies T = T _1 . Since T maps 
P alt to itself (Corollary |7.47|| it follows that T is a bijection on P alt . 

Lemma 6.12. Suppose that T is a degree preserving linear transformations from 
polynomials to polynomials. If f and Tf have a common root for all f then Tf is a 
multiple ofi. 

Proof. We can assume that T(l) = 1, and let T(x n ) = a n) iX l . Since T(x n ) 
and x n have zero for a common root the constant terms a n o are zero for n > 0. 
Next, x + 1 and T(x + 1) = ai ix + 1 have — t for a common root, so ai i = 1 and 
T(x)=x. 

Now assume that T(x k ) = x k for k < n. 



Since — t is the only root of (x + t) n we see that T(x n )(— t) = (— t) n for all t 



(Tf)'. 



T(x 2 )' = 2ax + (3 
wT(2x) = 2wx 




n-l 



and therefore T(x n ) = x n . 



□ 



If f and Tf have a common interlacing then Tf has a simple form. 
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Lemma 6.13. If J is a degree preserving linear transformation such that Tf and f 
have a common interlacing for alii e P then there are a, |3, y so that 

T(f) = (ax + |3)f'+yf 

Proof. Since x n_1 is the only polynomial that interlaces x n , we see that if n ^ 2 
then T(x n ) has a n — 1 fold root at zero. Write 

T(l)=b 
T(x n ) = a n x n + b n x n ^ n > 1. 

Now we use the fact that T(x — t) n must have a root at t if t ^ 0. For n = 2 
this means that 

= T(x - t) 2 [t] = T(x 2 )[t] - 2tT(x)[t] + t 2 T[l] 

which implies 

T(x 2 ) = 2xT(x) -x 2 T(l) 

Thus a 2 = 2ai — b and b 2 = 2b \. We now follow the same argument, and 
show by induction that 

a n = nai - (n- l)b b n = nb n _i 

which implies that 

T(x n ) = x n (naj - (n - l)b ) + Tuc n_1 bi 

= a lX (x n )' - b (x(x n )' - x n ) + b!(x n )' 

and by linearity 

T(f ) = ai xf ' - b xf ' + b f + bi f ' 
= ((ai-bo)x + bi)f' + b f. 

□ 

Compositions mapping P to itself are very simple. 
Lemma 6.14. Suppose g (x) is a polynomial. 

(1) Ifi (g(x)) e P for alii e P then g is linear. 

(2) If f(g(x)) e P for all f e P pos fen g is either linear, or is quadratic with 
negative leading coefficient. 

(3) Jfg(x) + c e P for all ceK f«en g is Zmear. 

(4) if g(x) + c e P/or all c> then g is quadratic with negative leading coefficient. 
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Proof. If we take f to be linear, then (1) and (2) follow from (3) and (4). If the 
degree of g is odd then the horizontal line y — — c will intersect the graph of 
g(x) in only one point if c is sufficiently large. Thus, in this case the degree of 
9 is 1. 

If the degree of g is even then the horizontal line y — — c with c large 
and positive will intersect the graph of g in at most two points, and hence 
the degree of g is two. Since these horizontal lines are below the x-axis, the 
parabola must open downward, and so the leading coefficient is negative. □ 

Here is another characterization of affine transformations. 

Lemma 6.15. Suppose that T: P — > P is a linear transformation. T distributes 
over multiplication ( T(f • g ) = Tf ■ Tg ) if and only ifT is an affine transformation. 

Proof. If p = T(x) then by induction T(x n ) = p n . Since T(x + c)=p + ceP 
for all ceR, we see that p is linear, so write p = qx + b. If f (x) = Y. Q i xl then 
the conclusion follows from 

□ 

The next lemma describes those transformations whose translations also 
preserve roots. 

Lemma 6.16. IfT is a linear transformation on polynomials such that 

1. (T + a)P C P for all aeR. 

2. deg(T(x n )) =n 

then there are constants c, d such that Tf = cf + df. 

Proof. For any polynomial g 6 P, (T + a)g = Tg + ag € P. By Proposition |1.35l 
Tg and g interlace. The conclusion now follows from Theorem l6.2l □ 

A similar argument shows 
Lemma 6.17. IfT is a linear transformation on polynomials such that 

1. (T + a)P alt c Pfor all a e K 

2. deg(T(x n )) =n 

then Tf (x) = cf + (d + ex)? for certain c, d, e € K. 

It is difficult to tell when two linear transformations T, S have the prop- 
erty that Tf and Sf interlace for all f € P, but it is easy if T, S are multiplier 
transformations. Such transformations map x n to a constant multiple of x n . 

Lemma 6.18. Suppose T(x n ) = t n x n and S(x n ) = s n x n both map P to itself. 
Assume that t n zs always positive and | t * n i \^ + \ \ ^ 0. Jf Tf and Sf interlace for all 
f G P then there are a, b e R so that Sf = + aTf + bx(Tf )'. 
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Proof. We use the same approach as we have been following, and choose a test 
polynomial f such that Tf has a multiple root - this root must also be a root of 
Sf, and this will give us a relation between T and S. We let f = x n (x — a)(x— 1) 
so that 

Tf(x] = (atn-d+a) xt 1+n + x 2 t 2+n )x n 
Ignoring the factor of x n , the discriminant of f is 

A = (-l-a) 2 t 2 +n -4at n t 2+n 
so if we choose a to make A zero 



2 

2+n 



-t 2 +n + 2 t n t 2+n -2 J- (tn t\ +n t 2+n ) + t* t 
Q = 7 

then we can solve Tf = with this value of a. The hypotheses on t n ensure 
that Q£l. The root is 



tn t 2+n — J— (tn t 1+n t 2+n ) + t n t 2+n 
T = 

t l+nt 24 

and if we compute (Sf)(r) we get 

/ S 2+n t n ti +n + (—2 Si + n t n + S n tl+nj t24 







ti +n — 2 t n t 2+n + 2 y t n t 2+n [— t 1+n + t n t 2+1 

If the second factor is zero then simple algebra shows that t n = 0, contradict- 
ing our hypothesis. Consequently 

= S 2+n t n tl+n + (—2 Sl + n tn + S n tl+n) t 2+n 

If we define u n = s n / t n then this relation simplifies to 

= U n+2 - 2u n+ l + U n 

This implies that u n = a + bn and hence s n = (a + bn)t n . It follows easily 
thatSf = aTf + b(Tf)'. 

□ 

The transformation x n i— * ^f- was shown in B142I to map polynomials 
whose imaginary part is 1/2 to polynomials whose imaginary part is 1/2. Such 
a transformation can not map P to itself, as the next lemma shows. 
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Lemma 6.19. Suppose that T maps polynomials with complex coefficients to polyno- 
mials with complex coefficients. Suppose that I_i and L2 are intersecting lines in the 
complex plane with the property that T: P Ll — ► P Ll and T: P Ll — ► P Ll . Then, T 
is a shifted multiplier transformation. That is, if a = Li Pi L2 then there are constants 
a n such that 

T(x — oc) n = a n (x — a) n for all integers n. 

Proof. All polynomials in P Ll nP 1-2 are constant multiples of powers of (x — a). 
Since T maps P Ll n P Lz to itself, the conclusion follows. □ 

If the transformation considered in [142] also mapped P to itself, then T(x— 
l/2) n would be a multiple of (x — l/2) n , for all n. This is false for n = 2. 



6.2 The domain and range of a linear transformation 

If T is a linear transformation that maps P 1 to P^ where I, J are intervals, then 
there are restrictions on the kinds of intervals that I, J may be. Up to affine 
transformations there are three types of intervals: all of R, half infinite inter- 
vals (e.g (—00, 1)), and finite intervals (e.g. (0, 1)). The only restriction in the 
next lemma is that T is non-trivial - that is, the image of T is not all constant 
multiples of a fixed polynomial. 

Lemma 6.20. If J is an interval, and T : P R — > P J is non-trivial then J = R. 

Proof. Suppose that T: P — ► P J . Choose f <§; g such that Tf and Tg are not 
constant multiples of one another. Since f and g interlace, so do Tf and Tg, 
and hence Tf + aTg 6 P J for all aeR. 

For any r that is not a root of Tg the polynomial h = Tf + jf^ffi Tg has r 
for a root, h is not the zero polynomial since Tf and Tg are not constant multi- 
ples of one another. Consequently there are polynomials in with arbitrarily 
large and small roots, so J must be R. 

□ 

Table 16.21 shows that all cases not eliminated by the lemma are possible. 
For instance, the reversal map f 1— * f rev satisfies P (1 '°°' (n) — ► p(°-H_ 



all of 1 



half infinite finite 



all of R 
half infinite 
finite 



x l 1 ► Hi 






X 1 h-> (-ljl^x 1 


X 1 1 ► (x)i 


f 1 — > f rev 


X 1 q Hi 


x l 1 ► Ai 


x 1 1 ► Ti 



Table 6.1: Realizing possible domains and ranges 



If T: P la ' bJ (n) — ► P then we can easily find upper and lower bounds for 
the range of T. 
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Lemma 6.21. If T: P la,bl (n) — ► P(n) and preserves interlacing then 
T: P [a,bl (u) — ► P [r,s ](n) a;/zere s is f/ze largest root o/T(x — b) n and r is tTze 
smallest root ofT{x — a) n . In addition, for any f G P (a ' b) (n) i(;e fcaue 

T(x-a) n T(f) ^T(x-b) n 

Proof. Choose f € p (a ' b ' (n). If we join (x — b) n and (x — a) n by a sequence of 
interlacing polynomials containing f (Lemma ll.64t then applying T yields 

T((x-b) n )< ■■• <Tf< •■• <T(x-a) n 

This shows that the largest root of T(x— b) n is at least as large as the largest 
root of T(f ), and the smallest root of T(x — a) n is at most the smallest root of 
Tf. □ 

If f < — g then the k'th root of f is bounded above by the k'th root of g, for 
all k up to the degree of g. Here's a simple consequence of this idea. 

Lemma 6.22. Suppose that T : P — ► P preserves degree and the sign of the leading 
coefficient. If I G P[n) and a k is the k'th largest root off (1 ^ k ^ n) then 

Y'th largest root o/T(f ) ^ k'th largest root ofT{ (x — cik) k ) 

Proof. Suppose roots (f) = (at) and define the polynomials 

g T = ( x - ai)(x- a 2 ) ••• (x- a T ) 

h r = (x- a r )(x- a r +i] ■ ■ • (x- a k ) r (H r ^ k) 

We have interfacings 

f = 9n < Qn-i < ■ ■ ■ < 9k = hi < h-2 • • • <H k = (x- a k ) k 

Since T preserves the direction of interlacing, and the degree, 

T(f) < • • • < T(g k ) = T(hi) < ■ • • <T((x- a k ) k ) 

The conclusion now follows from the observation above. □ 

If we apply this kind of argument to the derivative, then we can use the 
extra information that f < f . When we replace h T with 

h r = (x-a T )(x-a r+ i)---(x-a k ) k (x-a k+ i) 

then a similar argument shows 

Lemma 6.23 (Peyson llT36l ). Iff G P(n) and 1 ^ k < n then the k'th root of f ' lies 
in the interval 

. Clk+l — u k a k+ i — Q k 

Qk H : — — r- / Uk+1 : — — : 

n — k + 1 k+1 
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We can integrate the image of a linear transformation over an interval if 
the images of the endpoints interlace. 

Lemma 6.24. J/T: P (a b) — ► P and 

1) T preserves interlacing 

2) T(x-b) n <T(x-a) n n = l,2,--- 



then 



b 



T(x-tl n dtgP 



Proof. From Lemma [1.641 we can join (x — b) n and (x — a) n by a sequence of 
interlacing polynomials that contains f and g. Since T preserves interlacing 
we know that 

T(x - b) n < • • ■ <T(g) < ■ • ■ <T(f) < ■ • • <T(x - a) n 

Now the endpoints of this interlacing sequence interlace, so it follows that 
the sequence is a mutually interlacing sequence. In particular, we know that 
T(x — t) n is a family of mutually interlacing polynomials on (a, b), so the 
conclusion follows from Proposition 13 .371 □ 

It is not easy to have a finite image. Some multiplier transformations have 
finite images. For instance, the identity is a multiplier transformation. Less 
trivially, the map f i-» xf maps p' 0,1 ' to itself, and is the multiplier transfor- 
mation x 1 i—* ix 1 . There's a simple restriction on such transformations. 

Lemma 6.25. Suppose!: x v i— ► cux 1 maps P^ to itself. Then 

a Q ^ ai < ci2 ^ • • • 

Proof. We may assume that all at are positive. Consequently, the roots of 
T( x n - x n - 1 ) lie in [0, 1], and so 

< T( X n - X n - 1 = (a n X n - Qn-jX— ^(l) = On - Qn-l. 

□ 

If the constants decrease sufficiently rapidly, then no finite interval can be 
preserved. 

Lemma 6.26. Suppose that the multiplier transformation T: x 1 i— > a^x 1 maps P v ° s 
to itself, and limsup |a n | 1/n = 0. Then there are no finite intervals I, J such that 
TfP 1 ) c P J . 

Proof. It suffices to show that the absolute value of the largest root of T ( x + 1 ) n 
goes to infinity as n — > oo. If T(x + l) n = a n x n + ■ ■ ■ + qq, then the product 
of the roots is ao/a n . Since there is a root of absolute value at least |ao/ a n | 1/n , 
the conclusion follows since |a n | 1/n goes to 0. □ 
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For instance, since limsup (l/n!) 1/n = 0, it follows that the exponential 
transformation x n i— > x n /u! does not preserve any finite interval. 

Remark 6.27. By Corollary ll.501 the leading coefficients of a linear transforma- 
tion that preserves degree are either all the same sign, or they alternate. If T al- 
ternates the signs of the leading coefficients, and we define S(f ) (x) = T(f )(— x) 
then S preserves the sign of the leading coefficient. Thus, we usually only con- 
sider linear transformations T : P — ► P that preserve the sign of the leading 
coefficient. 

Table 16.21 shows that there are many possibilities for the degrees of 
T(l),T(x),T(x 2 ),T(x 3 ),.... Since x n + ax n -! e P we know T(x n ) andTtx^ 1 ) 
interlace, and so the degrees of T(x n ) and T(x n_1 ) differ by at most one, pro- 
vided neither is zero. 



Transformation degree of T(l), T(x),T(x 2 ), . 

gn> fg n,n + l,n + 2,n + 3, . . . 

g i ^ f*g 0,l,...,u-l,n,0,0,0,... 

g^ D k f(D)g 0, .. .,0, 1,2,3, .. . 

g i > g(D)f n,n-l,.. .,3,2,1,0,0,... 

x k c k f+d k f' 0,norn — 1 



Table 6.2: Degrees of a linear transformation where f € P(n). 



It is important to note that the transformation T : g i— ► g(D)f maps P — ► P, 
yet it does not preserve the sign of the leading coefficient, nor do the signs 
of the leading coefficient alternate. Instead, the sign of the leading coeffi- 
cient of T(g) depends on g, for it is the sign of g(0). This doesn't contradict 
Lemma [l .501 since T doesn't preserve degree. 

The last transformation maps x k to a linear combination of f and f ', which 
might be constant, so the degree is 0, n or n — 1 . 

6.3 Transformations determined by products 

When does a linear transformation of the form 

TL 

T(x n ) =n(x+aO (6.3.1) 

i=i 

preserve real roots? We can not answer this question but we can show that 
such a non-trivial T can not map P to itself. Next we will observe that if T in 
1 16.3.11 maps P pos to P then there are constraints on the parameters ai. There 
are examples of such root preserving transformations, for in Corollary l7.35l we 
will see that the choice ai = i — 1 maps P pos to itself. 



Lemma 6.28. If J is given in 1 16.3. It and T maps P to itself then all at are equal. 
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Proof. Choose beM, assume T preserves roots, and consider 

T((x - b) 2 ) = b 2 - 2b(x + at) + (x + aiHx + a 2 ) (6.3.2) 

The discriminant is 

— (a.2 — ai)(4b + ai — a2) 

If ai ^ ci2 then we can choose b to make the discriminant negative, so such a 
linear transformation T does not preserve roots. More generally 

T 

T(x r (x-b) 2 ) =n(x+aO • (b 2 -2b(x+a T+ i) + (x+a r+ i)(x+a r+2 )) 

(6.3.3) 

We can apply the same argument as above to conclude that a T+ i = a r +2/ and 
hence if T preserves roots then all cu are equal. □ 

Note that the conclusion of the lemma implies that there is an a such that 
T(f)=f(x+a). We restate the conclusion: 

Corollary 6.29. IfT:P — ► P maps monk polynomials to monk polynomials, and 
T[x l ) divides T(x 1+1 )/or all i ^ then T(x x ) — (x + a) 1 for some a. 

Lemma 6.30. J/T is given in \63.l\ andT:P v ° s — > P then 

dl ^ Q2 ^ (12 ^5 ■ • • 

Moreover, if a n+ i — a n for some n them a m = a n for all m ^ n. 

Proof. Choose negative b so that T ( ( x — b ) 2 ) is in P . If we choose | b | sufficiently 
large in (|6.3.2b then the sign of the discriminant is the sign of Q2 — ai and hence 
ai < d2- The same argument applied to l|6.3.3b shows that a T+ i < a r+ 2. 
Next, suppose that a n +i = a n . If f = (x + a) 3 x n_1 then 

n+2 n+1 Ti n-1 

T(f ) = (x + at) + 3a JJ (x + at) + 3a 2 ]^[(x + at) + a 3 ]J (x + at) 

11 11 

n-1 

= J^(x + at) x 

1 

(x + a n+2 )(x + a n+1 )(x + a n ) + 3a(x + a n+1 )(x + a n ) + 3a 2 (x + a n ) + a 3 

and without loss of generality we may set a n = a n+ i = 0. The resulting cubic 
polynomial 

(x + a n+ 2)x 2 + 3ax 2 + 3a 2 x + a 3 

is in P for all positive a. If we substitute a = a n +2 then the roots are (—.42 ± 
.35i)aTt + 2 and — 3.1a n+ 2. Consequently, an+2 = 0. We can now continue by 
induction. 

□ 
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For other examples of such transformations: see Lemma [12. 261 
6.4 Composition 

If T is a linear transformation, and f , g are polynomials then we can construct 
a new polynomial f (T) g. In this section observe that such composition is ubiq- 
uitous, and give conditions for f (T)g to define a map P 1 x P — > P. 

Lemma 6.31. If J is a linear transformation that preserves degree then there is a lin- 
ear transformation W and a constant c so that T(f ) = cf (W) [l)for every polynomial 
f. 

Proof. Since T preserves degree the polynomials T(x°), Tfx 1 ), T(x 2 ), . . . form a 
basis for the space of all polynomials. Define the linear transformation W by 
W(T(x 1 )) = T(x 1+1 ), and set c = T(l). We show by induction that for this 
choice of W and c we have Tf = cf(W)(l). It suffices to establish this for the 
basis above. We note that T(x ) — c — cW°(l). The inductive step is 

T(x n+1 ) = W(T(x n )) = W(cW n (l)) =cW n+1 (l). 

□ 

We can also define W by W(Tf ) = T(xf ). Since T is invertible we also have 
that W(g) = T(x(T _1 g)) This representation shows that if T is the identity 
then W is multiplication by x. The inverse of a transformation expressed as a 
composition has an inverse that is easily expressed as a composition. 

Lemma 6.32. IfW is a linear transformation that increases degree by one, T(l) = 1, 
andT{i)=i(W)[l)then 

T _1 (f) =f(T- 1 WT)(l) 

Proof. It is enough to verify the conclusion on a basis, and a good choice for 
basis is {W n (l)}. We note 

T _1 W n (l) = (T- 1 WT) n T- 1 (l) = (T^WTHl) 

□ 

We now look at some general properties of compositions. 

Theorem 6.33. Suppose that the linear transformation S has the property that (S — 
oc)h £ P for allh £ P and all a in an interval I. The bilinear map fxgH f (S)g 
defines a bilinear transformation P l x P — > P. 

Proof. We may write f = (x — ai) • • • (x — a n ) where all ai are in I. Since 
T(g)=f(S)g = (S-ai)---(S-a n )g 



we see that the hypothesis on S guarantees that Tg £ P. 



□ 
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A similar argument yields 

Corollary 6.34. Suppose that!: P v ° s — > P v ° s and for anyfeP v ° s we have that 
f < — Tf. The mapix g ^ f (T)g maps P v ° s x P v ° s to P pos . If g e P pos is fixed 
the map f f(T)g preserves interlacing. 

6.5 Recurrence relations for orthogonal polynomials 

All orthogonal polynomials satisfy a three term recurrence \6.5.1\ . We use 
this recurrence to establish recurrence relations for linear transformations 
based on orthogonal polynomials. We also get recurrence relations for the 
inverses of these linear transformations. Finally we specialize to polynomials 
of Meixner type. 

Assume that pi,P2, ■ • • is a sequence of orthogonal polynomials satisfying 
the three term recurrence 

Pn+1 = (in" + b n )p n + C n Pn-l (6.5.1) 

and define the linear transformations 

T : x n p n 

A : x n i — > a n x n 
B : x n i— > b n x n 
C : x n h-> c n x n - 1 

From the recurrence we find that 

T(xx n ) = d n xp n + b n p n + C n pn-1 

= x T(A(x n )) + T(B(x n )) + T(C(x n )) 
Since this holds for all n, and T is linear, we have for all polynomials f 

T(xf) = xTA(f) + TB(f) + TC(f ) (6.5.2) 
Similarly we have a recurrence for the inverse of T 

T _1 (xp n ) = T _1 ((l/a n )Pn+l - (bn/QnlPn - (Cn/Qn)Pn-l) 

= (l/a n )x n+1 - (b n /a n )x n - (Cn/ajx^- 1 

= xA^T-^Ptv) - BA^T-^Pn) - CA^T-^pn) 
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and consequently we have for all polynomials f 

T- 1 (xf ) = xA^T" 1 (f ) - BA _1 T _1 (f ) - CA^T" 1 (f ) (6.5.3) 
= (xA^ 1 - BA" 1 - CA" 1 ) T -1 f 

If T(l) = 1 then we have an explicit composition representation of T~ : 

T-^f) = f(xA~ : - BA~ : - CA _1 )(1) (6.5.4) 

Notice that this last representation is not well defined. However, we take 
116.5.41 to mean that we do not multiply f out, and then substitute, but rather 
we use the product representation that follows from 16.5.31 . 

These recurrences are too general to be useful. If our goal is to establish 
that a certain T preserves interlacing, then we must have that T(xf) <cT(f). 
This necessitates that there is some sort of interlacing relationship between 
A, B, C. We have seen that if A is the identity then it's probable that B, C are 
functions of derivatives. There is a well known class of polynomials for which 
this is true. 

The Meixner class of orthogonal polynomials [68 page 165] consists of 
those orthogonal polynomials p n that satisfy a recurrence 

Pn+i = (x - a - an)p n - (bn + 6n(n - l))p n -i (6.5.5) 

Upon specializing (|6. 5.31 1 and ^6.5.4) . where A is the identity, B = — (a + 
axD), C = — (bD + (3xD 2 ), we find the recurrences 

T(xf) = xTf - T( (a + axD + bD + 6xD 2 ) f) (6.5.6) 
T- : (f) =f(x+a+ctxD + bD + 6xD 2 Ml) (6.5.7) 

We have seen several examples of orthogonal polynomial families. Al- 
though /amz'Zies of orthogonal polynomials have many special properties, there 
is nothing special about any particular orthogonal polynomial, as the next the- 
orem shows. 

Theorem 6.35. Every f 6 Pis an orthogonal polynomial. In other words, there is 
a measure that determines an orthogonal family of polynomials, and f is a member of 
that family. 

Proof. Assume f S P(n). We will show that there is a sequence of polynomi- 
als Po/Pi/ • • ■ / Pn = f where the degree of pt is i, and there are non-negative 
constants at such that pt+i = (atx + bt)pt — Pi-i for 1 ^ i < n. It follows 
from Favard's Theorem [ 168 1 that the sequence po, . . . , p n is a set of orthogonal 
polynomials determined by a measure. 

We construct the pt inductively. Assume that f has positive leading coeffi- 
cient. Let p n = f, and p n -i = f. Since p n <p n -i we can find a p n -2 such that 
Pn = (a n x + b n )p n _i — Pn-2- Since p n -i < Pn-2 we continue inductively. □ 
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6.6 Linear transformations satisfying a recursion 

If a linear transformation satisfies a recursion then we can sometimes conclude 
that the transformation maps P alt or P pos to itself. 

Theorem 6.36. Suppose that linear transformations A, B, C, D, E, F, H map poly- 
nomials with positive leading coefficients to polynomials with positive leading coeffi- 
cients. Suppose that for all f £ p pos we have 

Af<f<Bf Cf<f< — Df Ef<f< — Ff Hf<f (6.6.1) 

then the linear transformation defined by 

T(xf) = ( HT + xTB + T(C - D) + (E - F)T ) f 

maps P v ° s to itself and preserves interlacing. If the interlacing assumptions l |6.6.11 
hold for all f e P alt then 

S(xf ) = ( HS + xSA + S(C - D) + (E - F)S ) f 

maps P alt to itself and preserves interlacing. 

Proof. We prove the theorem by induction on the degree of f , so assume that T 
maps P pos (n) to itself and preserves interlacing. By Corollary II. 461 it suffices 
to show that T(xf) <Tf for all f e P pos (u). If f e P pos (u) then f <Bf and 
so Tf TBf. Now all roots of TBf are negative, so xTBf < Tf. Next, we apply 
our assumptions d6.6.1| > to the polynomial Tf S P pos (n) and find 

HTf < Tf 

TCf<Tf Tf<TDf 
ETf<Tf Tf<FTf 

Finally, use Lemma [1.3 II to conclude 

T(xf) = ( HT + xTB + T(C - D) + (E - F)T ) f < Tf 

and hence T(xf ) < Tf which implies that T maps P pos (n + 1) to itself and pre- 
serves interlacing. The case for f £ P alt is similar. □ 

6.7 Mobius transformations 

A Mobius transformation determines a linear transformation on polynomials 
of fixed degree, and also an action of the space of all linear transformations. 
Recall that a Mobius transformation (or linear fractional transformation) is a 
transformation of the form 



(6.7.1) 
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To construct a map between polynomials we restrict ourselves to polynomials 
of a fixed degree n, and homogenize: 



M : x l (ax + b) l (cx + d) T 
We can express M in terms of M: 

M(f) = (cx+d) n f(Mx). 



(6.7.2) 



We have already seen an example of a Mobius transformation in d4.1.4> . If 
we take M(z) = 1/z then M(f) = z n f(l/z) which is the reverse of f. If I is an 
interval, then we define M(I) to be the image of I. With this notation, we have 
the elementary lemma 



Lemma 6.37. If Mis a Mobius transformation j6.7.2i then M is a linear transfor- 
mation that maps P l (n) to P M 1 ( 1 ' (n) bijectively. 

Proof. If f e P 1 thenM(f) = (cx+d) n f (Mx). Arootrof M(f) satisfies f( Mr) = 

so r =_M~ : s where s is a root of f, and thus s e I. The inverse of M is M -1 

□ 

P J and 



and so M is a bijection. 

Lemma 6.38. If I, J are intervals, and T is a linear transformation T: P 1 



a e R then the linear transformation S(x x ) = ot x T(x x ) maps P I/a 

agi 

pi/ « '' • pi 



Pl 



Proof. The result follows from the commuting diagram 

s 




□ 



The next lemma shows that only the type of interval matters. 



Lemma 6.39. Assume I and J are intervals, and that K\ and A2 are affine transfor- 
mations. There is a 2-2 correspondence between linear transformations mapping P 1 
to P J and linear transformations mapping P* 11 to P^^ 

Proof. The correspondence is given by the diagram 

P i ^ P J 



> pi« 



□ 



As another application, the next lemma shows two linear transformations 
that are related by a Mobius transformation. 
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P alt (n) > P alt (n) 

P (o,i) (n) 

Figure 6.1: A Mobius transformation acting on a linear transformation 

Lemma 6.40. Choose a positive integer n, and define linear transformations T{x x ) — 

( X+ n 1 )' S( xi ) = (i)' an ^ the Mobius transformation M(z) = Then T = SM 
on P(n). 

Proof. It suffices to verify that T = SM for a basis of P(n), where M(x x ) — 
x x {x + l) n ~\ so consider 

SMfx 1 ) =S(x i (l+x) n - i ) 

= s (S(tH 

= §(^-(^ + i))G + i) 

and the last expression equals T(x x ) using the Vandermonde identity 




□ 

We will later show that S : P alt — ► P. M maps (0, 1) bijectively to (0, oo), 
so we have a commuting diagram (Figure \6?7\ of maps that preserve roots. 

The second action determined by a Mobius transformation is in the space 
of linear transformations. Let T be a linear transformation that preserves de- 
gree. If M is given in | |6.7.1| | then we define a new linear transformation by 

T M (x r ) = (cx + d) T (T(x r ))(Mx) (6.7.3) 
If T(x T ) = p r then we can write this as 

cx + d 
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Note that T M ^ MT, since onP(n) 

MT(x r ) = (cx + d)> r (^) 
cx + d 

Ifpr(O) ^ then both T(x r ) and T M (x r ) have degreer. 
We first note that this action respects composition. 

Lemma 6.41. IfM, N are Mobius transformations then (Tm )n = Tn m- 

Proof. The proof follows from the commuting diagram 




□ 

Consider some examples. 

1. If T is the identity then T M (x n ) = (ax + b) n . Equivalently, T M (f) = 
f(ax + b). 

2. If T is any degree preserving linear transformation, Mz = az + b, and 
we let T(x n ) = p n then Tm (x n ) = p n (<ix + b). Consequently Tm (f ) = 
(Tf)(ax + b). Moreover, if I, J are intervals and T: P 1 — ► then Tm : 
pM-'i y pj 



3. The most interesting (and difficult) case is when the denominator is not 
constant. Because of the composition property (Lemma I6.41L and the 
examples above, we only consider the case M(z) = 1/z. Since x n f (1/x) 
is the reverse of f when f has degree n we see that 

T 1/z (x"-)= T(x-) REV . 

For example, suppose that 

T(x n ) = (x-l)(x-2)---(x-n). 



Applying M(z) = 1/z yields 

T Vz (x n ) = (l-x)(l-2x)---(l-nx) 



CHAPTER 6. LINEAR TRANSFORMATIONS OF POLYNOMIALS 



160 



Example 6.42. We can combine Mobius transformations with other root pre- 
serving linear transformations. For instance, we will see (Corollary l7.35b that 
(x) n ^ * n ma P s paU to itself and (LemmaO x k h-> ( a + n ) v x k maps P alt to 
itself. If M(z) = z/(l — z) the composition 

P««(n) ( ^ . P^(n) >k - k P»«(n) * P t«W 

is the linear transformation (x) k i— * ( q + n ) v x k (l — x) n ~ k and maps P alt (n) 
to P (ai) . This is Example 1 in ED. 



6.8 Transformations and commutativity properties 

We first characterize transformations that commute with x i— > ax, and those 
that commute with x i— ► x + 1. We also characterize some transformations 
defined by commuting diagrams. Finally, we list some diagrams that are sat- 
isfied by various linear transformations. 

Lemma 6.43. Suppose that the linear transformation T preserves degree. Then T 
commutes with xh« where a ^ 1 if and only if there is a g(x) = Y. CH* 1 sucn 
that Tf = f * g. Equivalently , T commutes with xm ax if and only if! is a multiplier 
transformation. 

Proof. Simply consider the action of T on x n . Assume that T(x n ) = Yk=o biX 1 . 
Commutativity implies that 

TL TL 

a n T(x n ) = a n ^b i x i =T(x n )(ax) =^b i a i x i 

k=0 k=0 

This implies that all bi are zero except for b n , so T is a multiplier transforma- 
tion. □ 

Lemma 6.44. Suppose that the linear transformation T preserves degree. Then T 
commutes with A : x x + 1 if and only if there is a g(x) = Y. Q i xl sucn that 



T(x n ) = ^ Q n-i ( n ) x 1 - Equivalently, 
i=o ^ 



T(f) = (g*Af) Tev . 

Proof. That T(x n ) has that form can be proved by induction. Conversely, note 
that the conclusion is linear in g, so we may take g(x) = x s . It is easy to see 
that the diagram commutes. 

— — L£=o(?K-ix l 



XI->X+1 

Y 

- TL /TL 



££* (?)x> ' - U,- =0 ( • )^(> k 



□ 
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Remark 6.45. If we ask that T commutes with the seemingly more general 
transformation x i— > x + y, we find that T is the same as is described in 
Lemma [6.43l For example, the Bernoulli polynomials satisfy this commutativ- 
ity property. However, this is not useful for us as most Bernoulli polynomials 
do not have all real roots. 

If we choose g(x) = e~ x then 

so we have a commuting square in two variables 



x^x+y 



x^L^ v (x) 



xi-^x+y 



(6.8.1) 



Example 6.46. IfT(x k ) = %^andAf(x) = f (1 - x) then AT = TA. 

Example 6.47. If T(x k ) = ^ = (*) and Af(x) =f(-x-l) then AT = TA. 
Example 6.48. T : x n i— > H n satisfies 

T(f)(x + y)=T,f(x + 2y) (6.8.2) 
which leads to the diagram 



xi-^x+2y 



Xl-tX+l) 



Example 6.49. T: x k i— > Hi<(x)x n k satisfies 
T(x-2 + ar = H n (^) = (- 
If we define S(f) = T(f(x - 2)) then 



-l) n T(x-2-a) 



xi— »ot+x 



xi — »a-x 



(6.8.3) 



Example 6.50. T: x k 

of degree n 



x) v ( x — oc ) n _ v satisfies the identity on polynomials 

(6.8.4) 



(Tf)(a-x) = (-l) n T(f Tev 

T 



reverse 



(-D n T 
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Example 6.51. T: x 1 i— ► ( x + n — i )^ satisfies 

Tf = (Tf rev )(-l-x 

since (— x — 1 + iL = (- 



(6.8.5) 



reverse 



l) n ( x + n — i )^ . Equivalently, 

T 

>- • 

xi— >— 1— x 



Example 6.52. The Laguerre transformation T: x n L n (x) satisfies several 
identities. This square is the Laguerre identity d7.10.4t . 



x 

xi— >x+y — 1 



(x + y-1) 



T also satisfies 



U(x) 

Homogenize 

■y n U(x/y) 



(6.8.6) 



We can combine the Laguerre and Hermite polynomials. Define the linear 
transformation T acting on P(n), and two induced transformations: 

T(z k ) =H k (z)L n _ k (z) 
T x (xV) = H r (x)L n _ r (x)u s 
T y (x r y s ) =x T H s (u)L n _ s (u) 

We have the following commutative diagram 

(x + u 

T- 




T x (x + u) 
which if written out is 



xi->2x/yi-ry/2 



k=0 



Y_ ( J H k (x)L n _ k (x)u n - k = Y_ ( J H n _ k (y/2)L k (y/2) (2x) k 



k=0 



Example 6.53. The falling factorials (x) k and L n /n! are polynomials of bino- 
mial type, and satisfy 
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L n {x + y) 



k=0 



k=0 



n\ Lic(x) L n _ k (y) 



Jc/ k! (n-k)! 
A polynomial family {p n } of binomial type satisfies 

p n (x + y) = Y_ ^Pk(x)p n -k(y) 



For a general family of polynomials of binomial type, the transformation 
T: x n p n (x) satisfies 

T 



x'- 

xi-^x+ij 

¥ 

(x + y) n — 

where the transformations T x , T v are the induced transformations. Consid- 
ering the inverse transformations yields 

T- 



-^Pn(x) 

xi-^x+y 

¥ 

Pn(x + y) 



xi-^x+ij 

V 

(x + y) n 



-Pn(x) 

xi-^x+y 

¥ 

Pn(x + y) 



6.9 Singular points of transformations 

If T is a transformation, and f e P is a polynomial for which the degree of Tf 
is different from the degree of Tg where g is arbitrarily close to f, then we say 
that f is a singular value for T. We are interested in transformations for which 
(x + a) n is a singular value for infinitely many n. It is often the case that the 
behavior of such a transformation changes at a, and that there is some sort of 
symmetry around x = a. These symmetries are captured in the commuting 
diagrams of the previous section. If T is a linear transformation that preserves 
degree, then there can be no singularities, but there can be symmetries. We 
know of the following examples: 

1. T: x k ~ <x) k (x_a) n _ k T(x - 1)- = (<x) n 



2. T: x 1 ^ ( x + n-i L T(x + l) n = (-l) 
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3. x k ^ HvfxlH^vfx 



T(x-l) n = 2 n/2 H n (0) 



4. x k ^ H k (x)H n _ k (x) 



T(x-i) n =2 n (l + i) 



5. x k i > L k (x)L n _ k (x) 



6. T: x k i ► H k (x)x n - k 



T(x-l) n = if nis odd 
T(x-2)^ = H n (0) 



JklXJn-k 



T(x-l) n = if nis odd 



6.10 Appell sequences 

Some of the properties of special functions and commuting diagrams are easy 
for the general class of Appell sequences. A sequence of polynomials {p n } is 
anAppell sequence [148J if the degree of p n is n, and 



(6.10.1) 



Such sequences are in 1 — 1 correspondence with formal power series g(x) 
[146 1 and satisfy 

g(D)x n =p n (x) for n = 0,1,... (6.10.2) 

Proposition 6.54. Suppose {p n } is an Appell sequence determined by g(x). Then, 
g(x) e P iff the linear transformation x n p n maps P to itself. 



Proof. The linear transformation is simply f i— > g(D)f. 



□ 



If a sequence of polynomials satisfies p n = a n np n _i, then the polynomials 
= Pn/(^oai • ■ • oc n ) form an Appell sequence since 



In = Pn/(«o ■ • • On) = annpn-i/foco ■••«„.)= npn-j/too • • • a n _i) = nq n _i. 

The Hermite polynomials are nearly Appell sequences. Since they satisfy 
= 2nH n _i, H n (x/2) is an Appell sequence. For a more interesting exam- 
ple, consider the transformation T : f i— > f (x + i) +f (x — i). Now Tf = 2cos(D)f, 
and so the sequences 

are Appell sequences with g(x) = cosx. 
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The Euler polynomials form an Appell sequence with g(x) — |(e x + 1). 
This function is not in P, so we can not conclude that the linear transforma- 
tion determined by the Euler polynomials maps P to itself. Indeed, it doesn't. 
Similarly the Bernoulli polynomials have g(x) = (e x — l)/x, and the corre- 
sponding linear transformation doesn't map P to itself. 

Any Appell sequence satisfies the identity 

Vn(x + y) =j[_ (£)pk(x)y n - k (6-10.3) 

k=0 ^ ' 

which we have already seen for the Hermite polynomials in ( |6.8.2> . Equation 
(|6-10.3b implies that we have the commutative diagram 

X n s-PnlXj 



xmx+tj 



XMx+y 



[x + v) n -Pn(x + y) 

If T: x k i— > pi c (x)x n_k then the lemma below shows that T*(x — y) r has 
degree n for all y other than y = 1, so 1 is a singular point. 

Lemma 6.55. If {p n } z's Appell andT: x k i— > p lc (x)x n_k , then 
T,(x-l+y) T =x n - T p r (xy) 

Proof. Since the conclusion is a polynomial identity it suffices to prove it for y 
an integer. It's trivially true for y = 1, and the case y = follows from 



T(x - l) r = Y_ ( T \ (-iy- k Vk (x)x r - k = x n - r Vn (x -x) = x— r p n (0). 

k=0 ^ ' 

Assume that it is true for y, and consider y + 1 

T,(x-l+y + l) r =^ Q T ,(x-l+y) k = ^ (?V~ k p k (xy) = 

k=0 ^ ' k=0 ^ ' 

x n - r f_ Qx r - k p k (xy) =x— r p T (xy +x) =x n - T p r (x(y + 1)). 

k=0 ^ ' 

□ 

6.11 The diamond product 



If we are given an invertible linear transformation T on polynomials then we 
can form a new bilinear transformation. 
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P x P — ^P x P 



fOg 

Y 

P - 



multiplication 



T- 1 

We call f g = T _1 (Tf • Tg) a diamond product. The product can be de- 
fined for any number of terms in a simple way. For instance, observe that 

fO(gOH)=fO[T- 1 (Tg-Th)] =T" 1 (Tf • T[T- a (Tg • Th)]) = T" a (Tf ■ Tg ■ Th) 

Consequently, we see that the diamond product is well defined for any 
number of factors, is associative and commutative, and equals 

f 1 0f 2 0---0fn=T- 1 (Tf 1 .Tf 2 ...Tf n ) 

We can now prove a general result: 
Proposition 6.56. Suppose that T is a linear transformation such that 

1. T: P 1 — >P J 

1. T _1 : P J (l) — ► P K (1) 
Then every f e P 1 can be written the form 

c(x - ai) (x - a 2 ) • • • (x - a n ) 

where c is a constant and each x — cu is in P K . 

Proof. From (1) we see that T(f ) e P J , and consequently can be factored T(f ) = 
Co(x — bi) • • • (x — b n ). Write T _1 (x — bt) = Ct(x — at) where each x — a t is in 
P K by (2). If c = c • • • c n then 

c(x-ai)® (x-a 2 )® ••• (g)(x-a n ) = c T _1 (ciT(x - aj) • • • c n T(x - a n )) 

= T- 1 (c (x-b 1 )---(x-b n )) 
= T- a (Tf)=f 

□ 

Here are some examples of diamond products, 
binomial Let T _1 (x n ) = Then 



x\ A /X 

r 



T- A (x 
r + s 
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This was considered by Wagner 11781 who introduced the name dia- 
mond product. See Propositi on II l.lOOl 

falling factorial Let T _1 (x) n = x n . Then 



See Question l43l 

We can generalize the diamond product construction by replacing multi- 
plication by some other bilinear map. Given m : P x P — > P then we form a 
new bilinear transformation 



See Proposition lll.lOll for an application. 

Although affine transformations distribute over multiplication, they do 
not generally distribute over diamond multiplication. We need a connection 
between the affine transformation and the defining transformation. 

Lemma 6.57. Suppose f g = T~ : (Tf • Tg) and A is an affine transformation. If A 
commutes with T then A distributes over ■ Conversely, if A distributes over and 
A commutes with T for polynomials of degree 1, then A commutes with T. 

Proof. The first part is just definition: 



m 



n+m 



This product is discussed in § 1818.81 



exponential If T(x n ) = x n /n! then 




fOg=T- 1 (m(T(f),T(g))) 



A(fOg) = AT (Tf • Tg) = T~ : A(Tf -Tg) = T (ATf • ATg) 



T (TAf • TAg) = Af OAg. 



Conversely using induction 



AT(fg) = A(Tf Tg) = ATf ATg = TAf TAg [induction] 



T(Af • Ag) = TA(fg) 



□ 
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6.12 Other root preserving transformations 

There are linear transformations that increase the degree of a polynomial by 
more than 1 and still preserve roots. Such transformations W will not preserve 
interlacing if the degrees of Wf and Wg differ by more than 1 . 

A trivial example is Tf = p ■ f where p S P. This transformation does 
preserve interlacing. A more interesting example that does not preserve inter- 
lacing is 

T:x n ^(x 2 -l) n 

so that deg(Tf) = 2deg(f). This transformation can also be expressed as 
Tf = f(x 2 -1). If f € P'- 1 - 00 ' and roots (f) = (ai,...,a n ) then roots (Tf) = 
(. . . , ±(a.i + l) 1 ^ 2 , . . . ). Thus, T preserves roots for polynomials in p' -1 - 00 '. 
We can generalize this idea. 

Lemma 6.58. Suppose that g e P(r) has the property that no relative maximum 
or minimum of g has value lying in (—1,1). If f e P (_1,1 '(n) then f(g(x)) e 
P l - U) (nr). 

Proof. Since g has no extrema with value in (—1, 1) it follows that every line 
y — s where — 1 ^ s ^ 1 meets the graph of g in r distinct points. 

Without loss of generality we may assume that f has all distinct roots. If 
f (t) = then — 1 < t < 1 and we get r roots of f (g(x)) from the r solutions to 
the equation g(x) = t. This accounts for all nr of the roots of f(g(x)). 

□ 



Corollary 6.59. iff e P' 1,1 ' (n) and T r is the Chebyshev polynomial, then f (T T (x) ) 
is in pt-Wl (nr). 

Proof. The graph of the Chebyshev polynomial T r (see Figure 16.2b on the in- 
terval (—1, 1) oscillates between its r — 1 relative extrema of 1 and —1. Now 
apply the lemma. □ 

For example, since T2 = 2x 2 — 1, the transformation f (x) 1— * f (2x 2 — 1) maps 
to itself. 
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Linear transformations that 
preserve roots 



We apply the results of the previous chapters to show that particular linear 
transformations preserve roots. 

7.1 Multiplier transformations 

The simplest linear transformations are multiplier transformations, which are of 
the form x 1 1— > aix\ We will see in Theorem l 1 5 .221 that if the series Y. Q i if is m 
ppos men me multiplier transformation maps P to itself. In § |5l5.3| we showed 
that various series were in p alt or p pos . Consequently we have 
Theorem 7.1. The following linear transformations map P toP and pp° s toP v ° s . 
x l 

1. x^\ 

v. 

x i 

2. x l i ► — for a ^ 

3. x 1 1— > -r—r- for positive a. 

x i 

4. x 1 1— > - — —for positive integer a. 

(oaj! 

5. x i i * q^x 1 / *" Iql ^ 1/4. 

x l 

6. x 1 1-^ —for\q\ > 4. 



170 
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7. x 1 1— > (d) i x 1 -/or d a positive integer. 



8. x l i— > ( d+ J 1 )x 1 /or d a positive integer. 



i!qW • 
9. x x h-> — x x for Iql < 1. 

(q;q)i 



20. x x i * " x T /or |q| < 1. 



Proof. The first two follow from <5.3.3t , the third one from 115.3. 41 . and the 
fourth from 111381 62,#162]. The generating function of (d) i is in P pos : 

1=0 ' i=0 V 7 

The generating function for x n /[n]! is in P - see 15.41 The generating function 
for C^- 1 ) is 

i=0 v 7 ' 

where L^-i is the Laguerre polynomial (see Section !/ 7 . lot . Since e x e pi 303 and 

Ld-i (~x) G P pos their product is in pP°l 

The last two use J5.1.1I I and part (1). □ 

We can extend (7) of the last result to non-integer values of n, but surpris- 
ingly there are restrictions. 

Lemma 7.2. If n is a positive integer and oc > n — 2 ^ then 

1. the map x l h-» (a^x 1 maps P pos (n) to z'tseZf, and P aU (n) to z'fseZf. 

2. the map x 1 k+ (^x 1 maps P v ° s [n] to itself, and P alt (n) to z'fse//. 

Proof. We find a recurrence and then proceed by induction. First, let T(x n ) — 
(a) n x n . Then 

T(x-x l ) = (a)Ja-i)x i+1 

= ax (a) i x x — xi (a) i x x 
^<xxT(x i )-xT(x{x i Y) 
T(xf) = axT(f) -xT(xf'). 
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We now show by induction on m that T maps P pos (m) to P pos (m) for 
m < n and to P(m) for m = n. For m = 1 we take f = x + r where r > and 
then Tf = r + ax is also in P pos (1). 

Assume that T maps P pos (ra) to itself and m < n. If we choose g = (x + r)f 
where f has positive leading coefficient and r > then 



Since f e P pos we know that xf < f and hence by induction T(xf ) < T(f ). It 
follows that T(g) = («x + r)T(f) -xT(xf') <T(f). If ra = n - 1 this shows that 
Tmaps P pos (n) toP(n). 

Now assume that m < n — 1. In order to show that T(g) is in P pos we need 
to show that (Tg)(0) and the leading coefficient of T(g) have the same sign. 
First, (Tg)(0) = r (Tf)(0) is positive since r > and T(f) has positive lead- 
ing coefficient and is in P pos . Next, the leading coefficient of T(g) is s(a) m+1 
where s > is the leading coefficient of f . Consequently, the leading coeffi- 
cient of T(g) is positive since a > n — 2 ^ ra and the conclusion now follows. 

The second part follows from the first by applying the exponential trans- 
formation (Theorem I7.1IT )). The results for P alt follow from applying the 
above to the composition f(x) i— > T(f (— x))(— x). □ 

If the generating function lies in P but not in P alt or p pos then the corre- 
sponding linear transformation maps P to P by Theorem ll5.22l 

Lemma 7.3. The following linear transformations map P ± to P. 



The q -exponential function was defined in Example 15.31 We define the 
q -exponential transformation (0 < q < 1) 



T(g) = axT(f) - xT(xf') + rT(f). 






Proof. The correspond the the functions e x , sin(x) and cos(x). 



□ 



Expptx 1 -) : x 1 h-> — 



(7.1.1) 



Theorem l7.1l shows that Fxp q maps P to itself. 
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7.2 Homogeneous transformations 

Some linear transformations are only defined on polynomials of bounded de- 
gree. For instance, if 

T(x l ) = x i {l-x) n - i 
then T is a linear transformation on all polynomials of degree at most n. 
Lemma 7.4. If a and c are not both zero then the linear transformation T given by 
TV) = (ax + b) l (cx+d) n-i 

maps P[n) to itself. 

Proof. Let F be the homogeneous polynomial corresponding to f. If f (x) = 
(x— m) • • • (x— a n ) then F(x,y) = (x— a\y) . . . (x— a n y). The image of f under 
the transformation T is simply F(ax + b, cx + d). In order to see this, we only 
need to check it for a basis, namely x x , in which case the result is clear. 
We therefore have the factorization 

T(f) = F(ax + b, cx + d) = (ax + b — ai(cx + d)) ■ ■ ■ (ax + b — a n (cx + d)) 

which shows that Tf has all real roots. □ 

This lemma also follows from the properties of Mobius transformations in 
§ 1616.71 - the argument here shows the factorization of the transformation. 

Corollary 7.5. The linear transformation T: x 1 i— > x x (l — x) n_1 maps P alt (n) to 

Proof. This is a special case of Lemma [6.371 where M = z/(l — z) since M maps 
(0, 1) to (0, oo. Alternatively, if a is a root of f G P alt (n) then the correspond- 
ing root of Tf is a/(a+ 1). □ 

The reversal operator is the special case of this construction where we set 
a = 0, b = 1, c = 1, d = 0. See gXl . 

Lemma 7.6. If the polynomial f (x) has all real roots then f REV has all real roots. If 
f e P pos (resp. P alt ) then f REV G P pos (resp. P alt ). 

If f <g and is a not root of fg then g REV <f REV . If f < g and f e P ± then 

j REV <j g REV 

Proof. Without loss of generality we assume that is not a root. The reverse of 
f is x Tt f(i), so its roots are the reciprocals of the non-zero roots of f. The rest 
of the properties are immediate from this observation. □ 

If f has as a root, then the degree of f REV is less than n. If is not a root 
of f, then ( f REV ) REV = f. However, if is a d-fold root, then x d ( f REV ) REV = f 

If we have a linear transformation T on P(n) then we can conjugate T by 
the reversal map to create a new linear transformation T rev on P(n). 
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Pfn) • P(n) 



P(n) ^Pfn] 



Here are some examples: 

1. If D is the derivative, then D is the polar derivative. 

2. If T is the transformation x 1 i— ► CiX 1 then T REV (xt) = c n _-j,x\ 

3. IfT(x x ) = pi where Pi is symmetric (pi = (pi) REV ) thenT REV (x l ) —Vn-x- 
This is the case for the Hermite polynomials (see § 1717.811 . 

7.3 The Hadamard product 

The generalized Hadamard product of two polynomials is the coordinate wise 
product with an extra coefficient per coordinate: 

(d + ajx + a 2 x 2 H h a n x n ) © (b + b a x + b 2 x 2 H h b n x n ) 

= Q b Co + aibicix + a 2 b 2 C2X 2 H h a n b n c n x n 

Equivalently, we can define it in terms of monomials: 

z = I Ci x 1 i = i and Ci > 

x © x I ► \ 

With the help of the method of generating functions (p. |500l l, we will prove 
that 

The generalized Hadamard product © determines a map P pos x 

oo i 

P h+ Pif and only if V a— e P^ 5 . (7 - 31) 

If all Ci are 1 we call it the Hadamard product and write f * g in place of 
f © g. If Ci = i! then we write f * ' g. It follows from the above result that * and 
* ' map P pos xPtoP. There is a an alternative proof for the Hadamard product 
(see 5 1919.13b that uses an identification of it as a coefficient of a polynomial in 
two variables. 

The generalized Hadamard preserves interlacing. 

Lemma 7.7. If® satisfies (|7.3.1L f < g e pP° s , he P then f © h< g © h. 

Proof. If g = od + r where f < t then f © h^af © hand f © h<r © h since all 
coefficients are positive. Adding these interfacings gives the result. 

□ 
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Our goal is to prove a multiplicative property of ©. We begin with a special 
case. 

Lemma 7.8. Iff e P v ° s , h e P, a, (3 > and © satisfies $73A$ then 

(x + a) f © (x + (3) h< f © h. 

Proof. We have the interfacings 

xf©xh<f©h since xf © xh = x(f © h) 

xf©|3h<f©h LemmalTTl 
af©xh<f©h LemmalTTl 
af © |3h<f © h 

and the conclusion follows upon adding the inter lacings. □ 

It's surprising that the generalized Hadamard product allows us to multi- 
ply interfacings. In the case of * there is a natural interpretation using polyno- 
mials with complex coefficients (p. |606t . 

Lemma 7.9. If f < g in P,h<kinP v ° s and © satisfies j73.ll then 

f © h< g © k 
Proof. Using Lemma [1.201 we write 

£j \ ■ ~i i -\ . -A I \ 



X — Si 



where the r's and s's are negative, and the a's and b's are positive. From the 
above lemma we know that for all relevant i, ) 

Qi f bj h 



f © h< 

X — Ti X — Sj 

Adding these interlacings gives the conclusion. □ 
Lemma 7.10. If mis a positive integer then 

1. the map x 1 1— ► ( 1 { l )x 1 - maps P to itself. 

2. the map x x i— » [rrCj^x 1 maps P to itself. 

Proof. The first statement is the map f h (1 + x) m * f. The second statement 
follows from the first statement and Theorem |9.87| Another proof uses gener- 
ating functions -see Theorem l7.ll □ 

It is obvious that the Hadamard product extends to a bilinear map P x 
— > P. Since 

e* 2 * (x + 1) 2 =x 2 + l 

it follows that e" 2 ^ P. Of course, we know that e~ x2 G P. 

The effect of scaling on the Hadamard product is easy to describe. 
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Lemma 7.11. Suppose that © is a generalized Hadamard product, and choose con- 
stants a, |3. If S, T, U are regions satisfying 

©: P s xP T — > P u 

then 

©: P aS x P pT — > P a|3U 

Proof. It follows from the definition of generalized Hadamard product that 

f (ax) © g((3x) = (f © g)(a(3x) 
Consideration of the diagram below yields the conclusion. 



p«S x p|3T ^ p«(3U 



xx yi-^axx |3ij 

I 



xmx/( a(3 ) 



P S xP T ^P U 



□ 



Remark 7.12. Differentiating a Hadamard product has no nice properties in 
general. However, we do have the following properties that are easily veri- 
fied, where m is non-negative. 

^ [(1 + x) m © xf] = m(l + x) m - x © f 



^■[e x ©xf] = e x ©f 



J [e x © x n 



f = e x © f 



Thus, if f e P then we can find polynomials po,Pi, ... so that 

1. po = e" © f. 

2. all pi are in P. 

3. (d/dx)pt = Pi_j for i = 1,2, 

Such polynomials have been called very hyperbolic. Here is a different con- 
struction of very hyperbolic polynomials due to Chebeterov. 

Lemma 7.13. Ifi — YJ) Q i xl <= P an d f n = Y.o a i (n-i)! ^ n ^ az,e a " refl ^ 

roots and f ' = f n -i- 
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Proof. Clearly f' n — f n _i; we need to see that f n e P. Now 







''=1 I "^'-i'ra 



is in P, and taking the reverse shows f n S P. □ 
For more properties of the Hadamard product, see § 1919.131 

7.4 Differential operators 

The most general differential operators on polynomials that we consider have 
the form 

n 

g^^fi(x)gW(x) (7.4.1) 

i=0 

where the fi(x) are polynomials. We first determine some restrictions on the 
coefficient functions for operators that map P to P. We then look at some 
general composition properties of polynomials, and apply these to some par- 
ticular operators. In Proposition ll5.66l we will characterize, modulo some pos- 
itivity and degree conditions, those differential operators l|7.4.1)l that map P to 
P. 

It is useful to view the differential operator as being determined by the 
two variable polynomial f(x,u) = ^fi(x)u l . These polynomials satisfy a 
substitution condition: 

Lemma 7.14. If f (x,y) = fi(x)y l determines a differential operator ([7.4.11 that 
maps P to itself then f (x, a) € P for all ael 

Proof. If the corresponding differential operator T maps P to P then T maps P 
to P . If we apply T to e ax € P we find 

Since the right hand side is in P we can multiply by e~ ax and remain in P. 
The result follows. □ 

We next consider two particular differential operators. Choose a polyno- 
mial h(x) = Y. a i xX an d define h(D)g = Y. a i9' 1 '- m this case the function 
fi(x) of l|7.4.1|l is just the constant at. We also define 

H(xD)g = ^a i (x — ) g(x) 



1=0 



If we expand (xD) v g we see we get a differential operator of the form ( |7.4.H . 
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Proposition 7.15. The map fx g i— ► f (D)g defines a bilinear map 
P xP — ► P. 

Proof. It follows from Corollary [7T231 that (D - ct)f G P for all f. Apply Corol- 
lary E3l □ 

Example 7.16. If we choose f = x 2 — 1 then for any g e P it follows that 
g — g" € P. Moreover, if g < h then g — g" < h — h". 

If n is a positive integer then the linear transformation f i— ► YJi=o WP ' T ' 
maps P to itself - simply choose f = (1 + x) n . 

Suppose f = x+1 and g = x+2. Although f -C g, substituting the derivative 
reverses the direction, e.g. x + 1 = f(D)x^>g(D)x = 2x + 1. This is not true in 
general, but depends on the sign of the constant terms. 

Proposition 7.17. Iff <§; g and he P and if 

f(0)g(0)>0 then ff(D)h»g(D)h 
f(0)g(0)<0 m \f(D)h<g(D)h 

Proof. Linearity implies that f(D)h and g(D)h interlace, but we don't know 
which direction. If we write f(D)h=|3g(D)h. + k then we need to determine 
the sign of the leading coefficient of k, and the signs of the leading coefficients 
of f(D)hand g(D)h. Compute 



K(x) = 


x n + a n ^x n - 1 + ■■■ 


f(x) = 


x m + • • • + bjx + b 


g(x) = 


x m + ■ • • + CiX + c 


f(D)H = 


b x n + (b a n _i +nbi)x n ~ 1 + 


g(D)h = 


c x n + (c a n -i + nci)x n_1 + ■ 


f(D)H = 


b « mil, bjCo-boCi n _ 1 
— g(D)h+n x 



+ ••• 



Without loss of generality we assume that f -c g • Consequently, we know the 
numerator biCo — boCi is positive. 

If boCo > then assume that b > and Co > 0. The leading coefficients of 
f (D)h and g(D)h are positive, and the leading coefficient of k is also positive. 
This implies g(D)h< f (D)h. 

If boCo < then assume that bo > and Co < 0. The leading coefficient of 
f (D)h is positive, the leading coefficient of g(D)h is negative, and the leading 
coefficient of k is negative. This implies q(D)h^> f (D)fi. 

The remaining cases are similar. □ 

Example 7.18. We know that f < T(g) for all g e P implies that T is the deriva- 
tive. However, we can find many different linear transformations S, T so that 
S(g) < T(g) for all g e P. Choose fi < f2 where f 2 (0) = and define 



S(g)=fi(D)g T(g)=f 2 (D)g. 
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The proposition above implies that S(g) and T(g) interlace, and since f2(0) =0 
the degree of T ( g ) is one less than the degree of S ( g ) . Thus, S ( g ) < T ( g ) . 

Corollary 7.19. The transformation 1 fxgn f [xD)g defines a bilinear map 

1. P pos x P ± — > P ± . 

2. P l {n) xP — >Pifl = (-oo,0) U (n,oo). 

3 ppos x p(0,l) > p(0,l) 

4 ppos x p(— 1,0) y p(-l,0) 

Proof. To prove the first part for f e j> pos it suffices to show that f(xD)g 6 P 
when f = x + a and a is positive. Since f (xD)g = ag + xg' this follows from 
Corollary l2.16l The second part also follows from the corollary. The third part 
holds since ag + xg' S p' 0,1 ' when a is positive and g € p' 0,1 '. The fourth part 
is similar. □ 

We can extend this to analytic functions: 

Corollary 7.20. Iff € P has no roots in [0,n] then T(g) = f(xD)g defines a linear 
transformation T: P[n) — ► P[n). 

Proof. The reason this is not immediate is that f is not necessarily presented 
as a limit of polynomials with no roots in [0, n] . However, if a sequence f i 
of polynomials converges uniformly to f, then the zeros of the f\ converge 
uniformly to the zeros of f . If f has no roots in [0, n], then the roots of ft must 
go to to roots that are not in [0, n] as i goes to infinity. If g G P(n) then for i 
sufficiently large no roots of fi are in [0, n], and we can apply Corollary 17.271 
as above. □ 

It's easy to express f (xD)g in terms of coefficients. If g(x) = bo + - • ■ +b n x n 
then 

f (xD)g = f (OJbo + f (l)bix + • ■ • f (u)b n x n (7.4.2) 

To see this, note that the conclusion is linear in both f and g so it suffices 
to establish it for the case f = x r and g = x n . In this case we see that 

f (xD)g = (xD) r x n = n r x n = f(n)g. 

We can interpret l|7.4.2ll as a Hadamard product, but the factor correspond- 
ing to f is not in P. Given a polynomial f, we form the infinite series 

F(x) = f(0)+f(l)x + f(2)x 2 + --- 

We see from J7.4.2|l that f(xD)g = F * g. Moreover, we will see in § 1717.131 that 
this series is a rational function of the form h(x)/ ( 1 — x) r , and so F is not in P . It 



1 It is not true that the transformation maps P aU x P^ 1 to P. For example, if f = (x — .7) (x • 
and g = (x + .7) (x + .1) then f (xD) g = .0049 + .216x + 2.47x 2 has no real roots. 
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is interesting that there are series that map P to P under the Hadamard prod- 
uct, but these series are not uniform limits of polynomials. This phenomenon 
is explained in Theorem l9.87l 

Using Corollary l7.19l and l|7.4.2Jl we get these corollaries. 

Corollary 7.21. If g — bo + bjx + • • • b n x n is in P and r is a positive integer then 
the following polynomials are in P. 

b + Tbjx + • • ■ + u r b n x n 

(r) r b + (t + l) r bix + • • • + ( n + r )„b n x n 

Proof. Choose f = x T in the first case, and f = (x + 1) • • • (x + r) in the second. 

□ 

We end this section with a discussion of the Bernstein polynomials. Given 
a function, the n-th Bernstein polynomial is defined by 



k=0 



n-k 



and in particular the polynomial corresponding to x T is 



k=0 v 



n-k 



Note that 



k=0 v 7 v ' 

We therefore have the diagram 

f(x)— ^f(Sp)(l +x )n 

"B n (x) 

Now from Corollary 17. 191 we know that T: f i— ► f(^p)(l + x) n determines 
a map P pos — > p'- 1 - 01 . In addition, since f G P 11 ' 00 ' implies that the roots of 
f (x/n) are greater than n, we find that T also maps P (1 '°°' to p'- 00 -- 1 '. More- 
over, it follows from the Corollary that we actually have p 15 ^ - 1 ' — > p. Com- 
bined with the Mobius transformation z z/(1 — z), we get the commuting 
diagram of spaces 

pR\(0,l) 1 s-p(-W) 

fi-Bn(f] 

In summary, 
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Lemma 7.22. The transformation f i— > B n (f) satisfies 

1 pR\(0,l) > p 

2 p[—°°,0) ¥ p(— oo,0) 

3 p(l,— oo ) y p(l,oo) 

7.5 Transformations based on differentiation 

We now look at specific transformations that are based on differentiation. The 
proofs are all easy consequences of the results in Chapter[T]and Chapter[6] 

Corollary 7.23. I/Tf = (a+ D)f then T: P — > P. If oc > f/ien T: P pos — > 
P pos . I/a< OthenT: P alt — > P alt . 

Corollary 7.24. If ot, y > 0, 11 = (ax — {5 — yxD)f, f/zen 

T : P — ► P. If f G P Qlt tften h < f . If we fazoic in addition that (3 > t/zen 

J. pa^t > palt 

Corollary 7.25. If a and e are positive then the transformation 
fx g i—* f (ax — eD)g defines a bilinear map P x P — > P. 

Proof. Apply Corollary EH □ 
Corollary 7.26. Themapfxg t-^ f(x+xD)g defines abilinear map P pos xP alt — > 

palt 

Proof. Apply Corollary E21 □ 

See Corollary 12 . 1 61 for the details about the next transformation. 

Corollary 7.27. Ifa£ (0,n) and Tf = (-a + xD)f then 
T: P ± (n) — * P ± (n) aniT: P(n) — ► P(n). 

Corollary 7.28. 7/ Tf = ( |3 + ( 1 - x 2 ) D) f where (3 > and f e P ( " u 1 f/zen Tf < f 
aniT: P ( - u) — .pl- 1 - 00 '. 

Proof. Note that 1 — x 2 is positive on the roots of f . Let 

h— (f3 + (1 — x 2 )D)f. Since the leading coefficient of h is negative it follows 
from Lemma [L20l that h<f. At x = 1 we see that h(l) = fjf(l) , so h has 
at exactly one root greater than 1. There are no roots less that -1, so h £ 

p(— l,oo) |— I 

Corollary 7.29. If Tf = axf + x(l - x)f then T maps p(-°°'°] fo ftse/f. 

Proof. Apply Lemma [L20l after factoring out x. □ 

A similar argument shows that 

Corollary 7.30. If Tf = ((2x+ a) - (1 -x 2 )D)f where |a| < 1 then! maps P { - u] 
to itself. 
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7.6 The falling factorial 

Recall (x) n — x(x — 1) • • • (x — u + 1). Since (x) n+1 = (x — n)(x) n the falling 
factorials are of Meixner type with parameters (a, a, b, (3) = (0,1,0,0). If 
T : x n i— > (x) n then it follows from 1 16.5. 6\ and 1 16.5. 7ft that 

T(xf ) = xT(f ) - T(xf ) (7.6.1) 
T- X (xf] =x'r 1 (f) +xT- : (f)' 

The polynomials T _1 (x n ) are also known as the exponential polynomials 11481 
page 69] or the Bell polynomials 

Proposition 7.31. 7/T(x n ) = (x) n = x(x- 1) . . . (x-n + 1), then T maps P alt to 
itself. T- 1 maps P pos to itself. 

Proof. From | |7.6. 11 , we can write Tfxf ) = xT(f)— TB(f) where B(f) =xf. Since 
Bf <f for any f with all positive roots, Theorem 16.361 shows that T preserves 
interlacing for polynomials with all positive roots. 

We can also write T~ : (xf ) = xT~ : (f ) + BT _1 f, so that the second statement 
now follows from Theorem [636] □ 

Remark 7.32. We can use the falling factorial to explicate a subtlety of Theo- 
rem [63] If T is a transformation that preserves interlacing, then we can define 
S by T(xf) = S(Tf), and Sg <J g for g = Tf. From Theorem 16.31 we see that if 
Sg <. g then S has a certain form, namely Sg = (ax + b)g + (cx 2 + dx + e)g'. 
This would seem to contradict the fact that T is arbitrary. 

In the case of the falling factorial, where T(x n ) = (x) n , we can show that 
S(x n ) = x(x— l) n . In particular, S(x T1 ) is not interlaced by x n . Moreover, S does 
not have the simple form given above. The reason is that we have ignored the 
domains of definition of these results. The explicit form of Sg was determined 
by substituting x n for g. However, we only know that Sg <J g when g = Tf, 
and f e P alt , yet T^x* 1 is a polynomial in P pos . 

To establish that S(x n ) = x(x- l) n , define L(f(x)) = f (x - 1). The map S is 
defined by S((x) n ) = (x) n+1 , and this can be rewritten as S((x) n ) =xL((x) u )). 
By linearity, Sf = xLf, and substituting x n = f shows that S(x n ) = x(x — l) n . 

The falling factorial transformation is connected to a series identity. 
Lemma 7.33. IfT(x n ) = (x) n , then for any polynomial f we have 

Z^xWT^f). 

i=0 

Proof. By linearity we may assume that f (x) = x n . If we let h n (x) = Y. Tr* 1 / 
then it is easy to verify that xh^(x) = H n+ i. Since induction and integration 
by parts show that there is a polynomial g n such that h n = g n e x , we find that 
g^ satisfies g n+1 = x(g n + g^J. 
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We can now show by induction that g n = T _1 (x n ). It is clearly true for 
n = 0, and the recursion (|7,6.1ll shows that T _1 (x n+1 ) = x(g n + g^J = g n +i- 

This is an ad hoc argument. See 11481 page 70] for a proof based on the 
general principles of the operator calculus. □ 

From i6.5.7i we can express T _1 as a composition: 

T- 1 (f)=f(x + xD)(l). (7.6.2) 

We could have applied Corollary l7.26l to deduce that T _1 preserves interlacing. 

Corollary 7.34. If a polynomial f has all negative roots, then for all positive x 

f(x) 2 ^ f(x-l)f(x+l). 

Proof. First, assume that x is a positive integer. Using Lemma 17.331 we see if 
T(x n ) = (x) n and f € P pos then T^f € P pos , and so e x T- 1 (f) is in P^ 1 . The 
inequality is Newton's inequality (Theorem l4.81 for P. 

In general, let a = r + e where r is an integer and ^ e < 1. If g(x) = f (x + 
e ) then the desired inequality follows by applying the previous paragraph to 
g(x) since g(r) = f(a). □ 

7.7 The rising factorial 

Recall (x) n = x(x + 1) • ■ • (x + n - 1). If T: x n h-> (x) n then 

T(xf) =xT(f) + T(xf] (7.7.1) 
T -1 (xf) =xT- 1 (f) -xT- : (f)' 

Corollary 7.35. If T(x n ) = x(x + 1) . .. (x + n - 1), then T" 1 maps P alt to itself. 
TmapsP v ° s to itself. 

Proof. The proofs of the first two assertions are like the proofs for the falling 
factorial, and uses the recurrences of l|7.7.1b . 

□ 

The rising factorial satisfies a different recurrence than the falling factorial, 
and it leads to a composition formula. Let 2 Af (x) = f (x + 1). Then 

(*)tv+i =x(x + l) - • • (x + n) 

= x-A(x---(x + n-l)) 
= xA(x) n 

Consequently, if we define T(x n ) = (x) n then 

T(x n ) = (x) n = (xA)(x) lx _ 1 = • •• = (xAHl) 
2 see S I8l8.7| for the motivation. 
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and so we have the useful composition formula 

T(f)=f(xA)(l) (7.7.2) 

For more information about the rising factorial, see § 1818.71 
Sometimes the rising factorial transformation is hidden. The following 
lemma is based on the definition of L found in [3[. 

Lemma 7.36. If g = Y_ Q t xl then define L(f, g) — Y. x l Qi.A 1 f (0) where Ag(x) — 
g(x + l)-g(x). 

Iff G P pos and g G P then L(f, g) G P. 

Proof. Write f = V_ Since A(x) i = and A 1 (x) i = i! we see that 

bi = (A \ f , K0> and hence L(f, g)=Y x l aib t i!. Define T: ( x) t h- > x 1 and set h = 
T(f) = Y_bix\ If h G P pos and g € P then by Theorem |#7| Y x^bji! G P. 
From Corollary [735] we know T: P pos — > P pos , so f G P pos implies h G P pos . 
Thus we are done. □ 

The maps x n i— > (x) n and x n i— * (x) n are conjugates by the map x i— > — x, 
as expressed in the diagram 

palt | Un > palt 



ppos i ppos 

The diagram commutes since 

x- » (-l)-x- » (-l) n (x) n ~ tlU-x) n = (x) n . 
If we compose the falling and rising factorial transformations we get 
Lemma 7.37. 

1. The transformation (x) n (x) n maps p pos — ► pp° s . 

2. T/ze transformation (x) n (x) n maps P alt — ► P alt . 

3. T/ze transformation (x) k i— > (x)^^ maps P pos (n) — ► P pos (n). 

4. The transformation x k i— * (x) n _ k maps P alt (n) — ► P alt (n). 

5. T/ze transformation (x) k (a) k x k maps P pos (n) — ► P vos (n) for <x > 
n-2. 

Proof. The first one follows from the diagram 

ppos : ..".>. ppos 
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and the second is similar. The next follows from the diagram 



ppos 



P pos (n) 

A 

k 



ppos (n) reverse ; pp0 s (n] 



The next two use the diagrams 





ppos (n 



Given any linear transformation T, the map T(x l ) i— > T(x 1+ ) usually does 
not preserve roots. The only non-trivial cases known are when T(x n ) = [x) n , 
(x) n , or H n (Corollary Ell. 

Lemma 7.38. T/ze Zz'zzear transformations (x) i i— > (x) i+1 and (x) i t— * (x) i+1 map 
P P. 

Proof. If T(f ) = xf and S(f ) = f (x + 1) then the fact that the diagram 

fx- 11. h-^(xL 



commutes shows that the desired transformation is TS _1 . Now both S _1 and 
T map P to itself, and so the same is true for TS _1 . Incidentally this shows 
T(x n ) = x(x — l) n . The proof for (x) i is similar. 

□ 

Lemma 7.39. LetT(x n ) = (x) n /n!. 

J . ppos > ppos 

J . p(l,°°) j. p(l,oo) 

Proof. We have the identity 

(T(f))(l-x)=T(f(l-x)) 
which is proved by taking f (x) = x k , and using 

(i_x) k = y (x)t A 

k! 2— i! U 

i=0 v 



CHAPTER 7. ROOT PRESERVING LINEAR TRANSFORMATIONS 186 



Now we know that x n i— ► (x) n maps P pos — ► pp° s ^ anc [ U p n application 
of the exponential map, so does T. The identity above implies that we have 
the following commuting diagram 



p(— oo,0) 1 p(— oo,0) 



p(l,oo) > p(l,oo) 

□ 

Remark 7 AO. T above does not P 10,1 ' to P. Indeed, T(x — 1/2) 2 has complex 
roots. It is known [26] that T maps P 1+R to itself. See also Question l59l 

If we need to prove a result about polynomials without all real roots, then 
sign interlacing is useful. Consider 

Lemma 7.41. If g has p positive roots, and <x is positive, then 
h = (x — oc)g — xg' has at least p + 1 positive roots. 

Proof. Suppose that the positive roots of g are T\ < ■ ■ ■ < r v . The sign of h(rt) 
is the sign of — g'(ti) and so is (— l) p+1+1 . This shows that h has at least p 
roots, one between each positive root of g, and one to the right of the largest 
root of g. The sign of h(0) is the sign of — ah(ti). Since a is positive, there is a 
root between and n, for a total of at p + 1 roots. □ 

Lemma 7.42. If T _1 : (x) n x n , and f e P has p positive roots, then T _1 (f) has 
at least p positive roots. 

Proof. We prove this by induction on the number p of positive roots of f . There 
is nothing to prove if p = 0. Let g(x) = T _1 (f). By induction we assume that 
g has p positive roots. If h(x) = T _1 (x — <x)f where a is positive, then the 
recurrence (|7. 7.1| | shows that h = (x — a)g — xg'. Lemma [7.411 shows that h 
has at least p + 1 positive roots. □ 

7.8 Hermite polynomials 

The Hermite polynomials are the orthogonal polynomials for the weight func- 
tion e~* 2 on (—00,00). There is an explicit formula (the Rodrigues' formula) 
for the Hermite polynomials [168, page 25]: 



H n = (-l) n e^ [ A I (7.8.1) 



an expansion m a series 

tt/2 

(n-2k)! 



dx 



Hn^I , n ?vl , (-l) k (2x) n - 2k (7.8.2) 



k=0 
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and a recurrence relation 

H n =2xH n _!-2nH n _ 2 (7.8.3) 
An addition formula for Hermite polynomials is 

H n (x + u) =jr_ (yWx)(2y) n - k (7-8.4) 

k=0 ^ ' 

The Hermite polynomials are not of Meixner type since they aren't monic, 
but if we rescale them to make them monic then we get a recursion of the 
form Pn+i = *Pn — Tipn-i which is Meixner with parameters (a, a, b, (3) = 
(0, 0, 1, 0). From the recurrence (|7.8.3b it is easy to verify that = 2nH n _i. 
There is also a differential recurrence 

H n+1 = 2xH n - (H n )' = (2x - D)H n (7.8.5) 

and consequently 

H n = (2x-D) n (l) (7.8.6) 
If we set T(x T1 ) = H n then we have the composition 

T(f) =f(2x-D)(l) (7.8.7) 
It is easy to verify that 

T-!(f) =f(x/2+D)(l) (7.8.8) 

We need to be careful here because multiplication by x does not commute 
with D. We define (2x - D) n (l) inductively: 

(2x-D) n (l) = (2x-D) ((2x-D) n_1 (l)) 
and for f = Y_ Q t xl we use linearity: 

f(2x-D)(l)=^a i (2x-D) i (l) 

In this light the following lemma establishes an interesting relation be- 
tween these two definitions. The proof is by induction and omitted. 

Lemma 7.43. Zff(x) = (ax+bD) n (l) then 

f(cx+dD)(l) = (acx+(ad + b/c)D) n (l) 

The linear recurrences satisfied by the transformation T(x n ) = H n follow 
from d7.8.3t , and are 

T(xf) =2xT(f)-T(f) (7.8.9) 
T- : (xf) = (l^xT-^f) + (T _1 f)' (7.8.10) 
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Corollary 7.44. Let H n be the Hermite polynomial. 

1. IfT(x n ) = H n then T: P — ► P. 

2. The linear transformation x 1 1— > H n _t maps P[n) to itself. 

3. The linear transformation Ht i— > Hi+i maps P(n) to ztoe/f. 

Proof. For the first part apply Corollary 17.251 to (|7.8.7b . The next map is the 
conjugate of T by the reversal map - see § 1717.21 The map Hi i— > Ht + i is equiv- 
alent to f i — ^ (2x - D)f by (|7.8.5) . and this maps P to itself by Lemma [L20l As 
an aside, the map H n — > H n+ i evaluated at x n is x n_1 (2x 2 — n). □ 



There are also generalized Hermite polynomials B1461 page 87] that are 
defined for any positive y. They satisfy Hg = 1 and 

H; +1 =xH;-ttvHX_ 1 

The corresponding linear transformation preserves roots just as the ordinary 
Hermite transformation does. 

Different representations of orthogonal polynomials can lead to different 
proofs of Corollary 17.441 For instance, using J7.8.2b the Hermite polynomials 
can be defined in terms of the derivative operator. \57\ 

H n =2 n exp(-D 2 /4)(x n ) (7.8.11) 

The transformation T: x n — > H n can be factored as T1T2 where Ti(x n ) = 
(2x) n and T2(x n ) = exp(— D 2 /4)(x n ). Since exp(— x 2 /4) is in P, we can apply 
Theorem |5.36l and conclude that T2(f ) € P. Thus, T maps P to itself. 

The action of T: x n — > H n (x) on sin and cos is surprisingly simple. It's 
easiest to compute T on e lx . 

T(e«) = ^H 1e (x)- 

k=0 

= e- 2lx+1 (from TableUSS 
= e(cos(2x) — tsin(2x)) 
= T(cos(x) + isin(x)) 

and so we conclude that 



T(cos(x)) = ecos(2x) 
T(sin(x)) = — esin(2x) 
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7.9 Charlier polynomials 

The monic Charlier polynomials with parameter oc are orthogonal polynomi- 
als that satisfy the recurrence formula 

C^ +1 (x) = (x-n-a)C«(x)-anC«_ 1 (x) (7.9.1) 

The first five Charlier polynomials are in Appendix l24.9l Their recurrence 
shows them to be of Meixner type, with parameters (a, a, b, (3) = (a, 1, oc, 0). 
They satisfy the difference equation (analogous to the differential equation for 
the Hermite polynomials) 

AC^(x)=nC*_ 1 (x) (7.9.2) 

where A(f ) = f (x + 1) — f (x). Consequently the Charlier polynomials give an 
infinite sequence of polynomials with all real roots that is closed under dif- 
ferences, just as the infinite sequence of Hermite polynomials is closed under 
differentiation. 

The corresponding linear transformation is T(x n ) = C£(x). T satisfies the 
recurrences 

Tfxf) = (x- a)Tf - T(xf') - aT(f') (7.9.3) 
T- X (xf) = (x+ aJT-^f) + x(T- : f)' + a^f)' 
T- 1 (f)=f((x+a)(D + l))(l) 

Corollary 7.45. If T(x T1 ) = C£(x), the n-th Charlier polynomial, then T maps P alt 
to itself. J- 1 maps P pos to itself. 

Proof. If we set A(f ) = xf and D(f ) = f, then the Charlier polynomials satisfy 

T(xf) = (x - a)T(f) - TA(f ) - aTD(f) 
T-!(xf) = (x + a)T- : (f) + AT-^f) + aDT^f) 

Since A(f ) <C f and f < Df the results follow from Theorem l6.36l □ 

Since T _1 maps P pos to itself, we know that B n+ i ~ B n . Using the identity 

B n+ i =x(B n +B;) 

we see that actually B n+ i < B n . (See p.[724l) 

Remark 7.46. The Charlier polynomials satisfy a formula that is analogous to 
the composition formula for Hermite polynomials. This will be used in § 1818.61 
If we define A -1 f (x) = f (x — 1), then the transformation T above satisfies 

T(x n ) = C£(x) 

= (xA- 1 - oc) C£_! (x) by (Z2Q) and (17^21 
= (xA _1 - a) n (l) by induction 
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and so 

T(f) =f(xA- 1 -a)(l) (7.9.4) 
7.10 Laguerre polynomials 

As with so many special functions there are variations in the definition of the 
Laguerre polynomials. The usual notation is L£(x); we will see that the a is 
quite important, and so we also write L n (x;y) — Lu(— x). The definition of 
that we use is in [72J, but we set the leading coefficient of L n (x;u) to 1/n!. The 
Rodrigures formula is 

l£[ x ) = —}—±D n (e-*x n+a ) (7.10.1) 
x a e x n! 

x^ e x n! 



and a series expansion is 



k=0 



u-k / k! 



The Hermite polynomials | |7.8. lib and the Laguerre polynomials have nice 
representations in terms of the exponential of a quadratic differential. 

e «a x 3 y ( x ^) = n , a n Ln (Z^) (7.10.3) 

a 

The homogenized Laguerre polynomials have an expansion 

(y + l^Lnt-L-) = V" L k (x) (f)y n - lc (7.10.4) 

The expansion l|7.10.2| l shows that L n (x; y ) is a polynomial in x and y with 
positive coefficients. 

An alternative definition is found in |146| with leading coefficient (— l) n . 
We denote these alternative polynomials by L"(x), and note that we have the 
simple relationship L"(x) = n!L£(x). The three term recurrence is not useful 
for us, but we will use the composition formula 111461 page 110] 

V n a) M = (xD-x+a+l)L£ +1 

= (xD - x + a + 1 ) (xD - x + a + 2) ■ • • (xD - x + a + n) (1 ) 

= ( xD-x+a+lU l). (7.10.5) 
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Corollary 7.47. If T is the Laguerre transformation T(x n ) = (x) where oc ^ — 1 
thenT:P v ° s — ► P alt . 

Proo/. If we define S(f ) = (xD — x+ a+ 1 ) (f ) then from d"7. 10.5b the composition 
of maps 

* n - <*) n -» (S) n (l) 

is the Laguerre transformation. If U(x n ) = (x) n and V(f) = f(S)(l), then 
T = VU. From Corollary (735] U maps P pos to P pos . By Corollary EZ1 V maps 
P pos to P. Consequently VU maps P pos to P. Since the coefficients of T(f ) are 
alternating in sign, we see that T maps to P alt . □ 

If we define T(x n ) = L£(-x), and S(x n ) = L n (x; a) then S = T o EXP. It 
follows that S maps P pos to itself. We will see in Lemma [15. 41l that S actually 
maps P to itself. 

The Laguerre polynomials are also related to the linear transformation 

R:f h-> D n (x n f) 

where n is a fixed positive integer, whose action on monomials is R(x x ) = 
( n + i ) ■ x l . The exponential generating function corresponding to R is 

£^=I^ = eH n (-x). (7.10.6) 

i=0 ' 1=0 

Since both e x and L n (— x) are in p pos , so is their product. The fact that the 

generating function of R is in P pos also follows from Theorem [1522] since R is 
a multiplier transformation that maps P pos to itself. 

7.11 The integration and exponential transformations 

The exponential transformation x n i— > x n /n! is a fundamental linear transfor- 
mation that maps P to P. Although not all polynomials in P have integrals 
that are in P (see Example 11.21 we can find some polynomials for which all 
integrals have a root. These polynomials are exactly the image of the expo- 
nential transformation. We first define the integral (J) and exponential (EXP) 
transformations. Define 

exp : x n i— >x n /n! 
J :x n ^x n+1 /n+l 
f :x^x- +k /(n±l) k 
J k exp :xV^x n+k /(n + k)! 

Since e x e P, we can apply Theorem ll5.22l to conclude that EXP maps P to 
itself. The integral doesn't preserve all real roots as Example 11.21 shows, but 
we do have 
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Lemma 7.48. J/f is a polynomial with all real roots, and for every k ^ we define 
T(f ) = J k EXP f then 

1. T maps P to P. 

2. -\mapsP vos toP v ° s . 

3. T maps P alt to P alt . 

4. T preserves interlacing. 

Proof. It follows from the definition of J k EXP that 

J k EXPf = e x *(x k f) 

where "*" is the Hadamard product. The results now follow from the fact 
that e x e P pos and Hadamard products preserve interlacing. □ 

When we apply various exponential transformations, we get some inter- 
esting transformations that we express in terms of coefficients: 

Corollary 7.49. If f = Xa=o a i %x * s zn P then the following polynomials also have 
all real roots: 

n 

2_ i! X 

i=0 
n 



at 



5.^ 



inji nix 

t=0 

Proof. The first is EXP applied to f, the second is EXP applied to the reverse, 
and the third is EXP applied to the second. Multiplying the third by n! gives 
the fourth, and the fifth is n! times the second. □ 

We can prove a more general exponential result using the gamma function: 

Lemma 7.50. For any positive integer k the transformations x 1 jw^ %x an & xl ^ 
jljyy map P to itself. 
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Proof. In Example 15.81 we saw that V(z + 1)/T(kz + 1) is in P, and has all 
negative roots. The result now follows from Corollary 17.201 and ([7.4.2b since 

r(k + i)/r(ki + i) =i!/(ici)i. □ 

It is not an accident that EXP occurs along with J". 

rk 

Theorem 7.51. If f is a polynomial such that J f has all real roots for infinitely 
many positive integers k, then there is a polynomial g with all real roots such that 
f = EXP(g). 



Proof. If we write f (x) = ^ cu — then 



i=0 

A+lc 



i=0 



The polynomial on the right hand side of (|7. 11.1) has all real roots. Replace x 
by kx: 

A (kx) i+k _ k k x k ^- k k k 

^- ai Ti + k)! " ~kT 2- Qi kTT ' ' ' kTT x 

i=0 v ; i=0 



so the n-th degree polynomial 



Lk k 
. ai kTT ' ' ' kTi* 

1=0 



has all real roots. Taking the limit as k — * oo we see that atx x has all real 



i=0 



roots. This last polynomial is the desired g. □ 

The Laplace transform, when restricted to polynomials, is closely related 
to the exponential transformation. If f (t) is any function then the Laplace 
transform of f (t) is 



£(f)(x) 



e- xt f (t) dt. 

o 



If x is positive then Z(t n )(x) — -Sty- We say that the Laplace transform 
£(f) of a polynomial f = Y_ Q i xl is Y. "i^TT' Consequently, we can express 
£(f) in terms of EXP: 

£(f)(x)=-EXp- 1 (f)(l) 

X X 

since EXP _1 (x n )(i) = If f is a polynomial of degree n and £(f)(x) has 
n real roots, then is not a root, so EXP _1 (f)(i) has all real roots. Taking 
the reverse shows that EXP _1 (f)(x) has all real roots, and hence applying EXP 
yields that f 6 P. Summarizing, 

Lemma7.52. If i is a polynomial of degree n and £(f) has ureal roots then f e P(n). 
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7.12 Binomial transformations 

In this section we study the transformation T : x n i— > ( x ) and its inverse. The 
many identities satisfied by binomial coefficients yield recurrences for these 
transformations . 

Corollary 7.53. The linear transformation x 1 1— > ( x ) maps 



p(0,oo) ¥ p(0,oo) 

p(— 00/— i) > p(— oo,— 1) 

Proof. The map T is the composition x n — ► x n /n! — ► (*) where the second 
map is the falling factorial (see Proposition ^. 311 and the first is the exponential 
map. Since the falling factorial maps P alt to itself we have proved the first 
part. For the second one we use the identity AT = TA where Af (x) = f (— x— 1 ) . 
This yields the communicative diagram 

p(0,oo) 1 >. p(0,oo) 



p(— oo,— 1) a> p(— oo,— 1) 

since we know the top row, and A -1 = A. 
Corollary 7.54. ThemapHx 1 ) = ( x+ £ _i ) mapsP m] toP aU . 



□ 



Proof. This follows from Lemma 16.401 and the diagram following. See 
Lemma [830] for a more general result. □ 

It is not the case that T maps P~ 1,0 to itself - even T(x + 1 /2) 2 has imaginary 
roots. We are now going to show that T _1 maps P~ 1,0 to itself. 
We use a simple binomial identity to get a recurrence for T~ : . 



T c( * j . = (n ! l)x n 1 nx i: 

x • x n + x(x + 1) • nx 



n-l 



x r 1 ! J+x(x+i) r 1 



By linearity it follows that 

T- : (xf) =xT~ J f + x(x + l)(T- 1 f)' 
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Lemma 7.55. The map x n i— ► ( x ) satisfies T 1 : P l 1,0) — ► P ( 1,0) and preserves 
interlacing. 

Proof. If T -1 f £ F [ - h0] then T _1 f <. (T -1 f)' and so it follows that x(x + 
l)T _1 f <jT _1 f since all roots lie in (—1,0). Using the recurrence yields 
T _1 (xf) <?Tf and we can follow the argument of Corollary 11.461 to finish the 
proof. □ 

The behavior of T _1 at the endpoints of (—1, 0) is unusual: 

Lemma 7.56. T-^x™-) <T _1 (x + Infact, 

(x + l)T- 1 (x rt ) =xT" 1 (x + l) n 

Proof. If we define the affine transformation Bf(x) = f(x + 1) then we will 
prove the more general result (x + 1 )T _1 f = xT _1 Bf . It suffice to prove this for 
a basis, so choose f = (*) . The computation below establishes the lemma. 

* T -'»< = « T ^HiTHXn) + (n - l) > 
= x(x n + x^" 1 ) = (x + l)x n = (x + l)T _1 f 

□ 

Corollary 7.57. The map T : x n i— * (*) satisfies 
f 1 

T" 1 (x + t) n dteP for n = 1,2,-" . 

- o 

Proof. We can apply Proposition l3.37l once we show that T _1 satisfies T _1 (x + 
a) n «T -1 (x + b) n if0<b<a<l. We show more generally that if the roots 
of g are all at least as large as the corresponding roots of f then T -1 q <C T -1 f. 
Since T _1 preserves interlacing we know that there is a sequence of interlacing 
polynomials from T x n to T~ : (x + l) n that includes f and g. Since the end- 
points of this sequence interlace by the preceding lemma, we have a mutually 
interlacing sequence, and therefore T _1 g <§; T~ : f. □ 

We now consider the transformation T: x n i— > (x) n /u!. Since (— x) = 
(— l) n (^) we can apply the results of this section to conclude 

Lemma 7.58. The map T: x n i— > (x) u /n! satisfies 

1 J. p(-oo,0) y p(l,oo) > p 

2. T- 1 : P m) ^ P. 

If Af (x) = f (1 — x) then (x — l/2) k is invariant up to sign under T and 
its roots are invariant under A. The following shows that there are complex 
roots, and they all have real part 1/2. 
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Lemma 7.59. If T: x n i— > (x) n /u! then all roots ofl(x — l/2) n have real part 1/2. 
If n > 1 t/ze« t/iere z's a complex root. 

Proof. If we define 

S(f(x)) = (T(x-l/2))(x + l/2) 
then S satisfies the recurrence 

S(x n+1 ) = — ^-S(x n ) + - U ^ Stx"' 1 ) 
n + 1 4(n + l) 



Since S(l) = 1 and S(x) = x, it follows that S(x n ) has purely imaginary roots. 
The first part now follows from S(x n ) (x - 1/2) = T(x - l/2) n . 

If T(x — 1 /2) n had all real roots then they would all be 1 /2, but it is easy to 
check that T(x - l/2) n is not a multiple of (x - l/2) n . □ 

7.13 Eulerian polynomials 

The Eulerian polynomials can be defined as the numerator of the sum of an 
infinite series. If r is a non-negative integer, then the Eulerian polynomial 
A T (x) is the unique polynomial of degree r such that 

If we differentiate (17.13.11 and multiply by x, we get 

(r + l)A r + (l-x)A' r 



and so 



i=0 1 ' 



Aq = 1 At = x 

A r = x ( rA r _i + (1 - x)A;_ a ) (7.13.2) 



A n has all non-negative coefficients and Corollary 17.291 shows that the 
roots of A r are all non-positive, and A T < A T -i. Using (|7.13.2l l, we can write 
the recurrence for the transformation T: x n i— > A n : 

T(xf ) = xTf + xT(xf') + x(l - x) (Tf )' (7.13.3) 
There is a Mobius transformation that simplifies this recurrence. 
Lemma 7.60. If Mz = ^±1 then T M : P l<u) ^ p'- 1 - ' where T M is defined in 
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Proof. Let B n = x n A n (^±I). A bit of calculation using H7.13.2t shows that 
Bo = 1 and 

B n+ i =(x + l)(B n + xB^). 
This leads to the recurrence for Tm : 

T M (xf) = (x + l)(T M (f)+xT M (f)') 
The conclusion now follows easily by induction using this recurrence. 

□ 

Corollary 7.61. IfT: x n h-> A n then T maps P^ 1 '^ to P v ° s . 

Proof. If V is a linear transformation and M is a Mobius transformation then 
write (M)oV = Vm- Setting S = T (z+1) / Z then with this notation we have that 

1 

T = (z- 1) o [-] oT (z+1)/z 

since the composition (z — 1) o (1/z) o ((z + l)/z) is the identity. We now ap- 
ply Lemma [7.601 and the general results of § 1616.71 to deduce the commutative 
diagram 

p(0,l) I ^ pP°s 

S (z-1) 
p(-l,0) — >■ p(-oo-l) 

□ 

It appears that the roots of A n go to minus infinity and hence the roots of 
polynomials in the image of p' -1 - ' are not contained in any finite interval. 
If f is a polynomial of degree n, then we define the transformation W by 



If f = Y.i bt* 1 then 



Wf = ^b i (l-x) u - i A i 



This shows that Wf is a polynomial of degree n. W can be realized as a com- 
position with T and a homogeneous transformation 

W : i — > A]< i — > (1 — x) n - k A k . 

Corollary 7.62. The map W defines a linear transformation from p (_1 ' 0) to P. 

Proof. If f e P (_1 ' 0) then Tf e P alt where T is given in Corollary 17.611 Since 
W(x n ) =A„£ P pos has leading coefficient 1 we can apply Lemma IT2.27I to 
conclude that Wf € P. □ 
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7.14 Euler-Frobenius polynomials 

The Euler-Frobenius polynomials Il5lll arise in interpolation problems. These 
polynomials satisfy the recurrence 

P n+1 = 2xP n -(l-x 2 )P n (7.14.1) 
= (2x+(x 2 -l)D)P n 
= D((x 2 -l)P n ) 

If we define the linear transformations T(x n ) = P n and S(f) = D ((x 2 — l)f) 
then 

T(x n ) =S n (l) 
and for any polynomial f we have 

T(f)=f(S)(l) 

By Corollary 17.301 the transformation 2x + (x 2 — 1)D + oc maps p' -1 ' 1 ' to 
itself for | a| < 1 so we conclude 

Lemma 7.63. //T(x n ) = P n then T maps p'- 1 ' 1 ' to itself. 

There is a modification of the Euler-Frobenius polynomials that are also 
called Euler-Frobenius polynomials [183[. They are defined by applying a 
Mobius transformation to P n : 

En(x) = (x-l) n P n (^) 

These polynomials satisfy the recurrence 

E n = (1 + nx)E n _! + x(l - x)E n _! (7.14.2) 

It is not obvious that all the coefficients of E^ are positive, but this can be 
shown H183I by explicitly determining the coefficients. The recurrence J7.14.2b 
then shows that all the roots of E n are negative, and that E n < E n _i. We can 
use the fact that P n = S n (l) to derive a "Rodrigues' formula" for E n : 

x (1 — x) z 

7.15 Even and odd parts 

When does a polynomial with all real coefficients have roots whose real parts 
are all negative? The answer is given by Hurwitz: write a polynomial f in 
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terms of its even and odd parts. The polynomial f has all roots with negative 
real part iff the polynomials corresponding to the even and odd parts inter- 
lace. See CM V171.4] or Proposition|223U 

In this section we generalize the transformations that assign either the even 
or odd part to a polynomial. Under appropriate assumptions, these transfor- 
mations preserve roots and form mutually interlacing sequences. 

Write f(x) = f e (x 2 ) + xf (x 2 ). The even part of f is f e (x), and the odd 
part is f (x) so we define two linear transformations by T e (f) = f e (x) and 
T (f ) = f o (x) (see Page[84|. It is clear that T e and T D are linear, and satisfy the 
recurrences 



T e (xf)=xT (f) (7.15.1) 
T (xf)=T e (f). 

Theorem 7.64 (Hurwitz's theorem). Ifie P v ° s then T e (f ) and T Q (f ) are in P pos . 
In addition, xT Q (f) is interlaced by T e (f). Thus, 



ifdeg(i) is 



even then T e (f ) < T Q (f ) 
odd f/ienT e (f)<T (f). 



We will derive this theorem from the following more general result. If f (x) 
is a polynomial and d is a positive integer, then we can uniquely write 

f(x) = f (x d )+xfi(x d ) + ---+x d - 1 f d _ 1 (x d ) (7.15.2) 

Theorem 7.65. If f S P pos , d is a positive integer, and fo, . . .,fd-i are given in 
117.15. 21 then fo, f i, . . . , f d-i is a mutually interlacing sequence. In particular, all ft 
areinP v ° s . 

Proof. The idea is to use a certain matrix H such that f (H) contains all the fi's 
as entries. We then show that f (H) preserves mutually interlacing sequences. 
This yields ft S P pos , and the mutual interlacing of the sequence. 
Assume that f is monic, and write 

n n 

f(x) =^ ai x l = n(x + n) 

1=0 i=l 

where all r\ and a\ are positive. Define the d by d matrix H as follows. The 
upper right corner is x, the lower diagonal is all l's, and the remaining entries 
are zero. For instance, if d is three then H, H 2 , . . . , H 6 are 




X 2 











X 2 











X 2 
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If I is the identity matrix then 

n 



i=l 



i=0 





Xf d-1 Xf d -2 • • • 


xfi \ 


fl 


fo xf d _l . . . 


xf 2 


f 2 


fl 








Xfd-l 


Vd-i 


fd-2 ••• fl 


fo / 



An important property of H is that H d = xld, as we can see from the above 
matrices when d = 3. Upon consideration of the patterns in the powers of H 
we see that 



i=0 



where all terms are either of the form f i (x) or xft(x). 

Since (£ curl 1 ) (1, 0, . . . , 0)* = (f 0/ fi, . . ., f d -i) it suffices to show that H+tI 
preserves mutually interlacing sequences. This follows from Example 13.741 

□ 

Remark 7.66. The even part of a polynomial in p pos has all real roots. Is every 
polynomial in P pos the even part of a polynomial in P pos ? We use Newton's 
inequalities to show that the answer is no. 

Suppose that f = f e (x 2 ) + xf D (x 2 ) e P pos (2n), and let f e = L From 
| |4.3. 51 with k = 2 and r = 1 we see that 



4-3 (2n)(2n-l) 



a a 2 2-1 (2n-2)(2n-3) 
Now Newton's inequality for f e is 



>3 2- 



n 



n- 1 



(7.15.3) 



±>2- 



aoa2 n — J. 

Thus, if f e is the even part of a polynomial in P pos , then it must satisfy (17.15.31 
which is stronger than the usual Newton inequality. In particular, (x + l) n is 
not the even part of a polynomial in P pos for n ^ 2. 

Example 7.67. If we begin with a polynomial in P pos (or a stable polynomial, 
see Chapter l2ljl then we can form a tree by recursively splitting into even and 
odd parts. For instance, if we begin with (x + l) 8 we get Figure EU The 2 k 
polynomials in the kth row are mutually interlacing (but not left to right). 

There is an elementary reason why the even and odd parts never have a 
common factor. 



Lemma 7.68. Iff S P pos then f e and f have no common factor. 
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x 8 + 8x 7 + 28x 6 + 56x 5 + 70x 4 + 56x 3 + 28x 2 + 8x + 1 




x 4 + 28x 3 + 70x 2 + 28x + 1 8x 3 + 56x 2 + 56x + 8 




x 2 + 70x+l 28x + 28 8x + 56 56x + 8 



Figure 7.1: The even-odd tree of (x + l) 8 

Proof. Assume that they do. If f e = (x + a)g(x) and f = (x + a)h(x) then 

f(x) =f e (x 2 )+xf (x 2 ) = (x 2 + a)(g(x 2 )+xh(x 2 )) 

This is not possible, since f £ P pos implies that a is positive, and x 2 + a ^ P. 

□ 

7.16 Chebyshev and Jacobi Polynomials 

The Chebyshev polynomials T n are orthogonal polynomials given by the re- 
currence formula T = l,Ti = x, and T n+ i = 2xT n — T n _i. We study some 
Mobius transformations associated to the Chebyshev and Jacobi polynomials. 

The transformations associated to the reversal of the Chebyshev and Leg- 
endre polynomials satisfy some simply stated identities. We can use these to 
establish mapping properties for the reversal of Chebyshev polynomials. 

For our purposes we need an explicit formula: 

T n (x) = 2~ n f_ ( 2 r) (* + D l (* - D n "\ (7.16.1) 

i=0 ^ ' 

Lemma 7.69. If T is the linear transformation x n T n and Mz = |^ then 
T M : P m] -» P pos . 
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Proof. Using l|7.16.1l l we find a simple expression for Tm (x n ) : 




where the even part is defined in § 1717.151 Consequently 
T M (f ) = even part of f ( (1 + x) 2 ). 

Iff 6P 10 ' 11 then f((l + x) 2 ) is in P ( ' ' and by Theorem l7.64l the even part 
is in P pos as well. Thus T M maps P (ai) to P pos . □ 

Formulas analogous to 117.16. 1) hold for all Jacobi polynomials. From 
(1681(4.3.2)] 




If we apply the Mobius transformation M of Lemma 17.691 to the linear trans- 
formation J : x n P"' 13 we get 




(7.16.2) 



In particular, the Legendre polynomials are so defining L(x n ) = P^° we 
have the elegant formula 




(7.16.3) 



It follows from Corollary 1 1 3 . 1 71 that 

Lemma 7.70. The map x n i— > Lm(x u ) given in i7.16.3|l maps P alt to itself. 

We now state two identities for reverse polynomials, where T n and P n are 
the Chebyshev and Legendre polynomials respectively. 
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T(x n ) = (T n 



T(l-x) n = 







fl _ X 2W2 



n odd 



n even 



(7.16.4) 



S(x r 



S(l-x) 



2 in/2 



n odd 



n even 



(7.16.5) 



Lemma 7.71. J/T z's £roen akwe, fen T: P aLt U P pos — ► P. 

Proof. The proof relies on the commutative diagram that holds for (x — l) n 
which is a consequence of the identity l|7.16.4|l , and by linearity it holds for all 
polynomials. The map even returns the even part of f . 

T 



XI-fX + 1 



Since we know that the even part determines a map from P pos to itself, the 
claim in the top line of the following diagram follows by commutativity. 



j(-oo,0) 



- P 



x>-»x+l 



xi-^l-x 2 



p(— oo,— 1) 

A similar argument applies for P Q 



j(-oo,0) 



It 



□ 



7.17 Commuting diagrams of spaces 



The earlier commuting diagrams of transformations in § 1616.81 give rise to 
transformations between spaces of polynomials. These are useful to us be- 
cause we generally understand three of the four transformations, and so can 
deduce information about the fourth. 

Example 7.72. T: x n i-> H n satisfies (Corollary I7.44|l 



XH^X+ OCX 



pltt+ctt . 
T ; 



pR+ai/2 ^ p 

xh^x+ocx/2 



Example 7.73. T: x k i-> H k (x)H n _ k (x) satisfies (Corollary 19351 

p(0,l) 1 >■ P 
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Example 7.74. T: x k i— > LiJxJLn-iJx) satisfies (Corollary I9.53|l a similar dia- 
gram as the Hermite transformation above, but the image is smaller. 

T 



(0,1) 



reverse 



,(l,oo) 



(0,oo) 



(0,oo) 



Example 7.75. For the transformation T: x k \— > (x) v ( x — <x ) n _ y the following 
diagram commutes at the function level, but we are not able to prove either 
the top or the bottom. If one were true, then the other follows from the com- 
mutativity. 

p(0,l) 1 s- p(-oo,0) 



reverse 



xh- » oc— x 



,(l,oo) 



j( 0C,Oo) 



Example 7.76. The transformation T: x 1 i— > ( x + d — i ) d actually maps 

palt , psep ( Lemma |gJJ. 

p(0,l) ^ p(0,oo) 



d,oo) 



j(-oo,-l) 



Example 7.77. T(x k ) = &± satisfies (Lemma |7^9l 

p(l,oo) s- J 



xnl-x 



-oo,0) 



- p 



Example 7.78. T: x k Hk(x)x n k acts (Lemma l9.60b on polynomials of de- 
gree n: 

p(2,oo) 1 ^ p 



xi->4— t 



(-oo,2) 



> P 



7.18 The Pincherlet derivative 



If we are given a linear transformation T on polynomials we can form the 
Pincherlet derivative 

T'(f)=T(xf)-xT(f) = [M,T](f) 
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where M(f) = xf, and [M.,T] is the commutator TM — MT. In some circum- 
stances T' will map P to itself. 

Lemma 7.79. If f < g, f has leading coefficient a, g has leading coefficient b, then 
f — xg G P if a ^ b. 

Proof. Immediate from Lemma [1.201 □ 

Corollary 7.80. Suppose T: P — > P maps polynomials with positive leading co- 
efficients to polynomials with positive leading coefficients. If the sequence of leading 
coefficients o/T(x x ) is strictly increasing then V : P — ► P. 

Proof. Choose monic f. Then since xf <f and since T preserves interlacing, 
T(xf) < T(f ). By hypothesis T(xf) and T(f) meet the conditions of the lemma, 
so T(xf) — xT(f) = T'(f) G P. □ 

Corollary 7.81. The linear transformation x n i— > H n+ i — xH n maps P — > P. 

Proof. The leading coefficient of H n is 2 n . □ 

If T maps P pos to itself, then we have weaker assumptions. 

Lemma 7.82. Suppose T: pp° s — > ppos ma p S polynomials with positive leading 
coefficients to polynomials with positive leading coefficients, and preserves degree. 
Then,T:P pos — > P pos . 



Proof. Choose f G P pos . Then T(xf ) < Tf and since T(xf ) G P pos it follows that 
xT(f)<T(xf) and hence T(xf) -xT(f) G P pos . □ 

Corollary 7.83. The linear transformation x n h-> n(x) n maps P pos to itself. 

Proof. If T(x n ) = (x) n thenT'(x n ) = (x) n+1 - x(x) n = n(x) n . Alternatively, 
this is the composition 



differentiation . mult, by x x n ^<x) T 

> nx n - 1 > nx n 



□ 



7.19 Hypergeometric Polynomials 

Hypergeometric series are a natural class of functions that are sometimes 
polynomials. We show that for certain values of the parameters these poly- 
nomials are in P. We begin with the confluent hypergeometric function jFj 
which is defined by 

rf , , „ az a(a + l)z 2 f (a), z l 

We are interested in the case that a is a negative integer, and b is a positive 
integer. Since (a) i is zero if a is a negative integer and i > |a|, it follows 
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that iFi(a,b;z) is a polynomial in these cases, of degree |a|. For example, 
1F1 (— 4, b; z) is equal to 



„ 4z 6z 2 4z 3 z 4 

l--r- + 



F 1 (-d,b;-z) = ^i;(b ± d) i (J) Z 



b b(l + b) b(l + b)(2 + b) b (1 + b) (2 + b) (3 + b) 
and we can manipulate this into a more familiar form 

(b ^ 4) ((b + 4) 4 - 4z(b+4) 3 + 6z 2 (b + 4) 2 - 4z 3 (b + 4) 1 + z 4 (b + 4] 

In general when d a positive integer we can write 

l " -t- U Jj I 

i=0 V 

Applying Lemma [7^21 twice to this representation shows that 
Lemma 7.84. If a is a positive integer, and b is positive then 

iFi(-a,b;-z)eF po » 

There are many relationships involving hypergeometric functions - the one 
below shows that the above result also follows from the fact that the general- 
ized Laguerre polynomial L^ has all real roots for A > — 1. 

^(z) = 7=H ih(-n,l + *,*) 
I (1 + ~vj 

1F1 satisfies recurrences in each parameter: 



iFi(a,b,z) = 2 + 2a — iFi(l + a,b,z) - a + 1 iFi(2 + a,b,z) 
1 + a — b 1 + a — b 

iFi(a,b;z) = iFi(a,b + l;z) + ° , b ^ Z iFi(a,b + 2;z) 

b b(b + l) 

These recurrences can be used to establish this interlacing square for posi- 
tive integral a, and positive b: 

iFi(-(a + l),b;z) iFi(-a,b;y) 

< < 

iFi(-(a+l),b + l;z) < iFi(-a,b + l;y) 
If the hypergeometric series has only one factor in the numerator then we 
can apply the same argument. For instance 
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OO / i ^ 

and if we take d to be a positive integer then 

iF 2 (-d;b,c;-z)= — - — — Y~ f d ) (b + d) ; (c + dU 1 
(b±d) d (c + d) d ^ Vv 

and we can again apply Lemma [7.21 to yield that lTii— d; b, c; z) G P alt . 

7.20 Eigenpolynomials of linear transformations 

If T : P — > P is a linear transformation then we can look for eigenpolynomials 
of T. These are polynomials e for which there is a A e K so that Te = Ae. We 
consider several questions: does T have eigenpolynomials? If so, are they in 
P? Do successive eigenpolynomials interlace? What are explicit examples? 

Lemma 7.85. Suppose T: P — > P maps polynomials of degree n to polynomials of 
degree n. Let c n be the leading coefficient of T(x n ). 

1. If all Cn are distinct then T has a unique eigenpolynomial of degree n with 
eigenvalue c n . 

2. If\c n \ > \c r \for ^ r < n then T has an eigenpolynomial of degree n that is 
in P. 

Proof. If we let V be the vector space of all polynomials of degree at most n 
then the assumption on T implies that T maps V to itself. In addition, the 
matrix representation M of T is upper triangular. The r-th diagonal element 
of M is the coefficient c T of x r in T(x T ). If they are all distinct then T has an 
eigenvector corresponding to each eigenvalue. 

We now use the power method to find the eigenvalue. Choose any initial 
value vq and define 



Vk+l 




where |viJ is any vector norm. Since M has a dominant eigenvalue, the 
converge to an eigenvector v of M, provided that vo is not orthogonal to v. 

Recasting this in terms of polynomials, we choose an initial polynomial 
po in P(n). Since T maps P to itself, all the pk are in P. If this happens to 
converge to zero, then po was orthogonal to p. Simply perturb po, and we get 
an eigenpolynomial in P. 

□ 

The sequence of possible eigenvalues is quite restricted. 
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Corollary 7.86. Suppose that T : P — ► P preserves degree and has positive eigen- 
values A n /or n = 0, 1, Then 

oo , 

y ^ g p. 

z — n 

n=0 

Proof. The A n are the leading coefficients of T(x n ). Apply Lemma [15.331 □ 

Lemma 7.87. Suppose that T is a linear transformation P — ► P, p n and p n +i ate 
eigenpolynomials of degree n and n + 1 with eigenvalues A n and A n+ j. If |A n+ i > 
|An.| and f/ze roots of p n are distinct then p n +i < Pn- 

Proof. We show that g(x) = p n + ap n+ i G P for any a. If we define 
g k (x) = A~ k + aA^jPn+i 

then T k g k = g so it suffices to show that g k G P. Now 

9kM = A~ k (p n + (A n /A n+1 ) k p n+1 ) 

Since |A n /A n+ i| < 1 we can apply Lemma fl. 11 I f or k sufficiently large since the 
roots of p n are distinct. It follows that g k G P, and hence g G P. 

□ 

Remark 7.88. T(f ) = f — f " is a simple example of a linear transformation 
mapping P — ► P that has only two eigenpolynomials. If T(f ) = Af then 
clearly A = 1 and f " = 0, so f has degree or 1. The matrix representing T has 
all l's on the diagonal, so the lemma doesn't apply. 

We next show that multiplier transformations generally have no interest- 
ing eigenpolynomials, and then we determine the eigenpolynomials for the 
Hermite transformation. 

Lemma 7.89. IfT{x n ) — t n x n where all the ti are distinct and non-zero, then the 
only eigenpolynomials are multiples ofx 1 . 

Proof. If Te = Ae where e = Y. a i xl then at = Aticu. Every non-zero at 
uniquely determines A = 1/ti, so there is exactly one non-zero at. □ 

Some simple linear transformations that map P to itself have no eigenpoly- 
nomials. For instance, consider Tf = f + af. If f + af = Af then since the 
degree of f is less than the degree of f we see A = 1, and hence a — 0. 

The eigenpolynomials of the Hermite transformation T(x n ) = H n are 
given by a composition, and are also orthogonal polynomials. 

Lemma 7.90. If T(x n ) = H n then the eigenpolynomials ofT are given by the com- 
position 

g n = (3x-2D) n (l) 
and the corresponding eigenvalue is 2 n . 
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Proof. Since T(f ) = f (2x — D)(l) we can apply Lemma [7,43l to conclude that 

T(g u ) = g u (2x-D)(l) 
= (6x-4D) n (l) 

□ 

It isn't known how to determine if a linear transformation T has all its 
eigenpolynomials in P. If T eventually maps into P then all the eigenpolyno- 
mials are in P. 

Lemma 7.91. If T is a linear transformation such that for any polynomial f there is 
an integer n such that T n f e P, then all eigenpolynomials corresponding to non-zero 
eigenvalues are in P. 

Proof. If Tf = Af then choose n as in the hypothesis and note that T n f = A n f. 
If A is non-zero it follows that f e P. □ 



7.21 Classical multiple orthogonal polynomials 

It is possible to construct polynomials that are orthogonal to several weight 
functions [170J. The resulting polynomials are called multiple orthogonal poly- 
nomials and all have all real roots. In this section we look at the corresponding 
differential operators, and also observe some interlacing properties. 

Lemma 7.92. 

1. Suppose a > — 1. The operator T n/0l : f i— ► x~ a D n x n+a f maps P — ► P and 

pvos £ ppos 

2. Iff e P ± then T n+1 , a f <T n , a f. 

3. Suppose b > 0. The operator S n/ b, c : f | — * e bx2 ~ cx D n e ~ bx2+cx f maps P — > 
P. 

Proof. The first one follows from a more general result (Proposition 1 1 1 .1401 in 
several variables. To check interlacing we note that 

P W + T n+1 ,«f = x- a D n ((3x a+n f + Dx n+1+a f) 

= x- a D rt x rt+a (((3 + n + a + l)f + xf) 

Since f € P ± we know that h(x) = ((3 + n + a + l)f + xf e P which implies 
that 

P W + T n+1 ,«f = x- a D rt x n+a h 

where h e P. By the first part this is in P for all (3 and so the interlacing 
follows. 
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For the Hermite case it suffices to show that e bx2 cx D(e b * 2 + cx f) is in P 
if f is in P. Note that we know that D(e~ bx2+Cx f) is in P and has the form 



bx +cxg^ we can ' t multiply by e bx cx since e bx is not in P. We just 

e bx2-cx D(e -bx2 + cx f) = (f , _ (2bx _ c)f) 



e 

directly compute 



and f - (2bx - c)f is in P by Lemma[L2Q] 



□ 



If f = 1 in (1) then we have Laguerre polynomials, and if f = 1, c = 0, b = 1 
in (2) we have Hermite polynomials. 

If we iterate these operators we get, up to a constant factor, the classical 
multiple orthogonal polynomials of 11701 . Let n = (ui, ...,n r ) be positive 
integers, and a = (cto, . . . , a T ) where ai, . . . , a T are all at least — 1. The fol- 
lowing polynomials are all in P. The interlacing property above shows that 
the Jacob-Pineiro polynomials P" and P^ interlace if n and m are equal in all 
coordinates, and differ by one in the remaining coordinate. 



Jacobi-Pineiro 
Laguerre-I 
Laguerre-II 
Hermite 



n 



dx r 



(1 



nixo+tiH hn T 



n 



i=l 



dx^ 



n 



mi 



_— 0Ci x 



dx n i 



n 

i=l 



1=1 

gbx 2 — cxqti g— bx 2 + cx 



ao+n-!- 



7.22 Laurent Orthogonal polynomials 

Laurent orthogonal polynomials satisfy the recurrence 

P-i =0 
Po = 1 

p n = (a n x + b n )p n _! - c n xp n _ 2 forn^l 

where all a n , b n , c n are positive. There are three simple recurrences that ap- 
pear to have many mapping properties (see Appendix 124.91 . We are able to 
prove some mapping properties for two of them. Notice that the recurrence 
implies that the coefficients are alternating, and hence if the roots are all real 
they are all positive. 

Lemma 7.93. If p n is the sequence of Laurent orthogonal polynomials arising from 
the recurrence Pn+i = x Pn— n x p n _i then the transformation T : x n i— ► p n satisfies 
T : P alt — > P alt and preserves interlacing. 

The transformation T: x n i— ► p n ev maps P aU to P alt . 
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Proof. Using linearity 

T(x • x n ) = xp n - nxp n _! 

= xT(x n )-xT(x n ') 
T(xf) = xT(f-f']. 

We now prove the result by induction - the base case is easy and omitted. 
Since f — f'<Cf it follows by induction that T ( f — f ' ) <C T ( f ) . Since the roots are 
positive by induction, we know that 

T(xf) =xT(f-f')<T(f) 

and the conclusion follows from Corollary II. 461 

For the second part, we have the recurrence T(xf) = T(f)— xT(f '). We can't 
apply Corollary 11.461 since T(f ) does not have positive leading coefficients. 
However, if we let S(f ) = T(f ) (— x) then we can apply the lemma to S. Since S 
satisfies the recurrence 

S(xf) =S(f) +xS(f) 

we see that by induction 

S(xf) =S(f)+xS(f)<S(f) 

andsoS:P Qlt — > P pos . □ 

Lemma 7.94. If p n zs the sequence of Laurent orthogonal polynomials satisfying the 
recurrence p n +i = (x + n)p n — nxp n _i then the transformation T: x n \— > p n 
satisfies T : P alt — > P aU and preserves interlacing. 

Proof. The proof is similar to the previous one. We check that 

T(xf) = xT(f + xf'-f) 

Now we know that 

f<f + xf'<f 
so f + xf'-f'<f 

By induction 

T(xf) =xT(f + xf -f )<T(f) 
and we apply Corollary 11 .461 again to complete the proof. □ 
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7.23 Biorthogonal polynomials 

In this section I would like to sketch a completely different approach to the 
problem of showing that a linear transformation preserves real roots. The 
main results are by Iserles and Norsett, and can be found in 1 90 1 . The idea is to 
begin with a parametrized measure, and to use it to associate to a polynomial 
a biorthogonal polynomial that is determined by the roots of the polynomial. 
There are a number of non-degeneracy conditions that must be met. In general 
the resulting transformation is highly non-linear, but there are quite a number 
of measures for which we get linearity. 

Choose a function w(x, a) and a measure d\x, and define a linear functional 



We start with a monic polynomial g (x) with all real roots X\, . . . , r n . Each root 
determines a linear functional £> n . We next try to construct a monic polyno- 
mial p(x) of degree n that satisfies 



This is a set of linear equations. There are n unknown coefficients, and n 
conditions, so as long as the determinant of the system is non-zero we can 
uniquely find p, which is called a biorthogonal polynomial. 

Next, we want to show that p has all real roots. The proof of this is similar 
to the proof that a single orthogonal polynomial has all real roots, and requires 
that the w(x, a) satisfy an interpolation condition. 

Finally, we define the transformation T(g) = p. It is quite surprising that 
there are any choices that make T linear. The coefficients of p are symmetric 
functions of the roots, and so can be expressed in terms of the coefficients of 
g, but in general there is no reason to suppose that there is linearity. 



f (x)cu(x, a) d^i 



£ ri (p)=0, £ r2 (p)=0, 



Chapter 




Affine Transformations of 
Polynomials 

In this chapter we investigate properties of polynomials associated to affine 
transformations. Recall that an affine transformation A satisfies A(x) = qx + b, 
and the action on polynomials is A(f (x)) = f (qx + b). In order not to burden 
the reader (and the writer) with lots of similar but different cases, we will 
generally confine ourselves to the affine transformations of the form A(x) = 
qx + b where q ^ 1. We are interested in those polynomials that interlace their 
affine transformations. In particular we will study the set of polynomials 

p A ={f e P f <Af}. 

8.1 Introduction 

We begin with general properties of these affine transformations for q > 1. 
A acts on the real line, and preserves order. Since q > 1 we see that A has 
a unique fixed point jz^- Consequently A is a bijection restricted to the two 
intervals 

A" =(-00,— — ) A+ = (— — ,oo) 

1-q 1-q 

Since A and A -1 preserve order, it is easy to see that A is increasing on A + and 
it is decreasing on A~ . More precisely, 

a < Aa iff <xe(j^,oo) 
Aa < a iff (-oo,^) 

The action of A on the roots of a polynomial is easy to determine. If f ( a) = 
then (Af)(A~ x a) = f(a) = so that the roots of Af are A -1 applied to the 
roots of f . Since A -1 preserves order it follows that f < — g implies Af < — Ag. 
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If f <§C Af and a is a root of f then we must have that A -1 a ^ oc, and hence 
a ^ Aa. We conclude that if f e P A then the roots of f must lie in {jz^,oo). 
For the three examples considered above we have 

A(x) = 2x f e P A => roots of f e (0, oo) 

A(x)=2x + 1 feP A => roots of f e (-1, oo) 
A(x)=x + 1 feP A => roots off e (-oo,oo ) 

Before we proceed, we should show that P A is non-empty. Define 

n-l 
i=0 

In case that A(x) = x + 1 we have 

(x^0>i = (x)(x + l)(x + 2)-"(x + n-l) = (x) n , (8.1.2) 
if A(x) = qx then 

(x;-l) A = (1 + x)(l + qx) ■ ■ ■ (1 + q n_1 x), (8.1.3) 
and again if A(x) = qx then 

(xjO>* = q(*)x n . (8.1.4) 

Since A _1 a < a for a e A + , if a e A + then the roots of ( x; a )^ in increasing 
order are 

A- n+1 (a) A- 2 (a), A-!(a),a 

and the roots of A( x; a ) A are 

A- n (a) A- 3 (a), A- 2 (a), A'V) 

Consequently if a e A + then (x; a) A <C A(x; a) A . 
Lemma 8.1. Iff £ P A then Af e P A . 

Proof. If f € P A then f <§; Af. Since A preserves interlacing Af <§C A 2 f and hence 
Af e P A . " ' - - n 

Polynomials in P A satisfy a hereditary property. 
Lemma 8.2. P A is an order ideal. That is, ifi g P A and g divides f then g e P A . 
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Proof. It suffices to show that if we remove one linear factor from f the result- 
ing polynomial g is still in P A . Since affine transformations preserve order, the 
effect of removing a root from f is to also remove the corresponding root in 
Af . The remaining roots interlace, and so g € P . □ 

The next lemma gives a useful criterion for determining if a polynomial is 
inP A . 

Lemma 8.3. The following are equivalent 

1. feP A 

2. There is a g such that f < — g and Af < — g. 

3. There is an h such that f < — h and f < — Ah. 

Proof. If (1) holds then f <c Af. Choosing g = Af shows that (1) implies (2), 
and taking h = f shows that (1) implies (3). If (2) holds and f (x) = Yl( x ~ r 
then A _1 Tt ^ so (1) follows from Lemma ll.4l If (3) holds then we know 
that f < — h and A _1 f < — h, so (2) implies that A~ x f <c f. Applying A to each 
side shows that (1) holds. □ 



8.2 The affine derivative 

We define the affine derivative D&f of f as follows: 

Af — f 

Definition 8.4. If f is a polynomial then Da^ = ■ 

Ax — x 

In case Af = f (x + 1) the affine derivative is the difference operator A(f ) = 
f (x + 1) - f (x). If Af = f (qx) then D A f is the q-derivative: 

f f(qx)-f(x) 

D„(f) = ■{ qx-x 1 X ^ (8.2.1) 
[f'(x) ifx = 

The affine derivative is well defined since q / 1 implies A(x) ^ x. It is 
immediate from the Taylor series that Da is continuous. If we let A approach 
the identity, then Da converges to the derivative. The degree of D A f is one 
less than the degree of f . As is to be expected for a derivative, D A x = 1, and 
there is a product rule 

Af-Ag-fg 

D A (fg) = 



Ax — x 

Af • Ag - Af • g + Af • g - f g 



Af • D A g + g D A f (8.2.2) 



Ax — x 
In particular we have 

D A (xf) = Af + x D A f = f + (Ax) D A f (8.2.3) 
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The formula for Da on the product of three terms is 

DaCWs) = D A f a • (f 2 f 3 ) + (Afi) • D A f 2 • (f 3 ) + A(fjf 2 ) ■ D A f 3 

and the general formula is 

n 

D A (fi ■■■f n )=Y_ A(fi • • • f k _i) • D A f k • (f k+1 • • • f n ) (8.2.4) 

The next lemma is the heart of the results about the affine derivative. 
Lemma 8.5. Iff e P A and a is positive then f — aAf S P A . 

Proof. We may suppose that f is monic of degree n, in which case the leading 
coefficient of Af is q n . Since f <|C Af we may write Af = q n f + h where f <\i 
so that 

f-oAf = (1- aq n )f- ah 
Writing f = q~ n Af — k where Af < k yields 

f-aAf = (q~ n - a)Af-k 

There are two cases, depending in the size of a. 

Case 1. If < a < q~ n then (1 — aq~ n ) and (q~ n — a) are positive, so by 
Corollary 11 ,30| we have 

f - aAf = (1 - aq~ n )f - ah < — f 
f - aAf = (q~ n - a)Af - k < — Af 



Case 2 If a > q n then (1 — aq n ) and (q n — a) are negative, so by 
Corollary 11 ,30| we have 

f-aAf = (1- aq~ n )f- ah — > f 
f - aAf = (q~ n - a)Af - k — ► Af 



In either case we can apply the affine criteria of Lemma 18.31 to conclude 
that f — aAf G P A . 

□ 

Remark 8.6. The proof of the lemma did not use an analysis of the roots. If we 
look at the roots then we get another proof of the lemma, as well as informa- 
tion about the behavior when a is positive. 

If the roots of f are r\ < ■ ■ ■ < r n then we know that 

A^Ti < Ti < A~ : r 2 < r 2 < ■ • ■ < A _1 r n < r n . 
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If a is negative then the roots {st} of f + aAf lie in the intervals 

(-00, A _1 Ti), (n, A _1 r 2 ) / • • • , (r n _i, A _1 r n ) 

and consequently si < A -1 ^ < Ti ^ st+i. This shows that Ast < Si+i, and 
hence f + ccAf e P A . 

If a is positive the roots lie in the intervals 

(A-^^n), {k- l r 2 ,T 2 ), ■■■ , (A^r^rn) 

The roots {st} satisfy St ^ Ti ^ A _1 Ti + i sC St+i. We can not conclude (indeed 
it's not true) that Ast ^ Si+i from these inequalities. 

If we make stronger assumptions about the roots of f then we can draw 
conclusions about f + ccAf . Suppose that f <C A 2 f . This means that the roots 
of f satisfy A 2 rt < The roots st now satisfy st ^ ri,A~ 2 rj. + i ^ st+i. 

Since the interlacing hypothesis implies that Art ?C A _1 Ti+i we conclude that 
Asi ^ Si+i. A similar argument establishes the following result: 

Lemma 8.7. If ~k is a positive integer greater than 1, f <C A k f and a is positive then 
g = f + aAf satisfies g <C A k ~ : g. 

The affine derivative has all the expected properties. 

Lemma 8.8. Suppose f e P A - 

1. D f /' . 

2. f < D A f. 

3. Af < D A f 

4. J/f,ge P A satisfying then D A f < D A g. 

Proof. Lemma 18.51 implies that f — Af € P , so the first result follows from 
Lemma I&21 Since f <|C f — Af and the smallest root of f — Af is the fixed point, 
we can divide out by x — Ax and conclude from Lemma r8.2| that f < DAf. The 
third part is similar. 

In order to show that Da preserves interlacing it suffices to show that 
Da(x + a)f < DAf where (x + a)f G P A . From the product rule (|8.2.3b we find 

D A (x+a)f = (x+a) D A f + Af 

Since (x + a) DAf < DAf and Af < DAf we can add these interlacings to con- 
clude D A (x + a)f< D A f. □ 

P A is closed under ordinary differentiation. 

Lemma 8.9. Iff e P A then f e P A 

Proof. Since f e P A we know that f + aAf e P for all a. Since ^Af = q A(f') 
we have that f + aqAf ' e P for all a and so f e P A . □ 
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Only in certain special cases do A and Da commute. 

Lemma 8.10. For any positive integer n we have that q n A n D q (f ) = D q (A n f ). 
Consequently, if A Da = DaA then Ax = x + b. 

Proof. First note that 

^x — x) — A(qx + b — x) — (q — l)(qx+ b) +b = q(qx + b — x) = q(Ax — x). 



This implies that for all positive n we have A 11 - (Ax — x) = q n (Ax — x), and 
hence A~ n (Ax — x) = q~ n (Ax — x). We can now compute: 

r, A n+1 f-A n f 
D q (A n f) = 



Ax — x 

a Af-f 

A- n (Ax-x) 
q n A n D q f 



A 



□ 

We next consider properties of f(A)g and f( Da)q. We first observe this 
immediate consequence of Lemma 1831 

Lemma 8.11. I/ge P alt and f € P A then g(A)f e P A . 

Corollary 8.12. If q g P pos (r) andi < A r+1 f then g(A)f G P A . 
Iff < A r fen £5 (k)A k f G P. 

Proof. Factor g, and apply Lemma inductively For the second part, apply the 
first part to g = (1 + x) r . □ 

Next we have an additive closure property of the affine derivative. 

Lemma 8.13. If g G P A and a is positive then g - a Dig e P A . 

Proof. We know that g < D A g so g — a D A g <C fl. Also, g Ag < D A g, so g — 
aDigCAg. Now apply Lemma [831 □ 

Lemma 8.14. I/ge P alt and f G P A fften g( D A )f e P*. 

Proof. Write g(x) = (x + ai) . . . (x + a n ) where all at are negative. Since 

g( D A )f = ( D A + Qi)...( D A + a n )f 
we apply Lemma [8. 1 31 inductively. □ 
Here's a simple property about determinants. 
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Lemma 8.15. Suppose f, Af e P A n P pos . 



If 



{Ax > xfor all positive x 
Ax < x/or all positive x 



f Af 




Af A 


v 2 f 


•f 


D A f 













The same inequalities hold for 

"A > "A 1 

Proof. We begin with the first statement. We need to show that f • A 2 f < Af • Af 
for positive x. Since Af is in p pos , Af is positive for positive x, so it suffices to 
show that (Af/A 2 f) > (f/Af). 

Since f e p se P we know that f <fC Af, and so there is a positive constant c 
and non-negative at such that 

Af = cf + > ai 

where the roots of f are {r-j.}. Thus 



Af v— Qi 

T - c + ^xTT- 

AfV v- 1 



and consequently Af/f is decreasing. Since Ax > x, it follows that 

Af , , 



f f 
-r-Ax) < 4-A 



A 2 f* 



If Ax < x then the inequality is reversed. The last statement follows from 
the above since 



f Af 


1 


f D A f 


Af A 2 f 


~ (Ax - x) 2 


D A f D 2 f 



□ 

Remark 8.16. ( x; q ) A in <8.1.1> is uniquely determined by the three conditions 
below. 

(1) (x^a)o = 1 

(2) (a^a)* = 

(3) D A (x2a)* = [n]A(x£a)i_j 

This characterization of (x; a) is motivated a similar result by l98l in the 
case Ax = qx. 
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8.3 Sign interlacing for P 

Let's look at some graphs in order to better understand P A . Suppose that 
f <C Af. The graph of Af is the graph of f, with some shifting to the left and 
scaling. For instance, if Ax = x + 1 and f = x(x — l)(x — 3)(x — 5) then the 
graph of f and Af is in Figure [83] 




Figure 8.1: Understanding A 

If f <? g and Af < g then the roots of g must lie in the thickened segments 
of Figure [531 It's clear that the roots of g in the figure differ by at least one, as 
is guaranteed by Lemma [8731 

If we translate the qualitative aspects of this graph, we get a version of sign 
interlacing. Assume that the roots of f are 

n < r 2 < • • • < r n (8.3.1) 

The roots of Af are 

A _1 ri < A _1 r 2 < • • • < A _1 r n (8.3.2) 
Since f Af we can combine these orderings 

A _1 ri < n < A _1 r 2 < t 2 < ■ • • < A _1 r n < r n (8.3.3) 

If f < g and Af < g all have positive leading coefficients then it is clear from 
the graph that g(r n ) > 0, gfA^Tn) > 0, g(r n _i < 0, g^V-i) < 0, and 
generally 

(-l) n+i g(n) >0 (-lr+^A-S) > (8.3.4) 
This is sign interlacing for A. We clearly have 

Lemma 8.17. If g is a polynomial of degree n — 1 that satisfies ([8. 3.41 then g G P A . 

The following polynomials play the role of f (x)/(x — rt) in Lemma ["1.201 
For 1 ^ k ^ n define 

f M = A[(x - n) • • • (x - r k _ a )] • [(x - r k+1 ) ■ ■ • (x - r n )] (8.3.5) 
These are polynomials of degree n — 1 whose roots are 

A _1 ri < A _1 r 2 • • • A _1 r k _i < r k+ i < • • • < r n (8.3.6) 
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Since the v\ are all distinct, it is easy to see that the f [ k ] are independent. 
Thus, they form a basis for the polynomials of degree n — 1. 

Lemma 8.18. Suppose that f e P A and g = Y. a k f [k] JPft^e a ^ it <ws non-negative. 
Then g <G P A and f < g and Af < g. 

Proof. The key point is that f <f[k] and Af <f[k] - this follows from (18.3.1b , 
<|8.3.2t , <[8.3.3b , l|8.3.5b . If all the at are non-negative then we can add these 
interlacings to finish the proof. □ 

The fact that f < D A f follows from Lemma EES From j8lT4t we see that if 

f(x) = (x — T\) ■ ■ ■ (x — r n ) then 



D A (f) = D A (x-Ti)-..(x-r n ) 

Tl 

= Y A ( (x-ri)>- - (x-T k _i) ) • 1 ■ ( (x-Tk+x) ■••(x-r u )] 

k=l 

= f [1] + h f [n] 

Consequently, this is another proof that f < DAf. 

8.4 The affine exponential 

We can formally construct an affine exponential Ea function, but the series 
might not converge. We want two simple conditions 

(1) £a(0) = 1 

(2) D A (E A )=AE A 

The relation DaEa = AEa determines Ea up to a constant factor that is 
settled by assuming E A (0) = 1. If we write 

E A (x) — 1 + aix + d2X 2 H 

and apply DaEa = AEa we find 

an-iA(x n -') = a n D A (x n ) 
and recursively solving for the coefficients yields 

00 n A k — 1 Y 

n=0k=l Wa 

where 

[k] A = (A k x - x)/(Ax - x) = [k] = q^ 1 + q^ 2 + • ■ ■ + q + 1. 
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and 

A k x = q k x + b[k+ 1] 

Now this derivation is not valid if b ^ 0, but we still can use this definition 
since we have that 

,k-i x ™-i Ak-l, 



D An^r-= A n 



, , Ma t\ Ma 

k=l k=l 

and thus term by term differentiation establishes condition (2). In particular 



Ax = x E-aM = —j (8.4.1) 



n=0 



Ax = qx E A (x) = Y_ ^"* n (8-4.2) 

Ax-,(x + l) E, M -f m^l±m (8.4.3) 

n=0 

Ax = x + 1 E A (x) = ^^ (8.4.4) 

n=0 n ' 

In the first case Ea is the usual exponential. The second one is the q- 
exponential of 115.1. Il l and has a representation as an infinite product. The 
last two have product representations for their finite sums: 



Ax = qx Zw^nC 1 -^ 1 -^*) ( 8A5 ) 

n=0 k=0 

Ax=q(x + 1) zfnTl=f[f 1 + T) 



Ax 



n=0 \k=l / k=l N 



n=0 k=l 



Unlike the partial sums of the exponential function, the partial sums of 
118.4.41 and l|8.4.3l l have all real roots, as the product representation shows. 
Since the sum ^ q k_1 /M converges for |q| < 1 and diverges for |q| > 1, it 
follows that the product (|8. 4.61 1 converges for | q | < 1 and diverges for q ^ 1 . 
Consequently, equation (|8. 4.31 1 is only valid for |q| < 1. The series in d8.4.4l > 
does not converge. 

The partial sums in the last two cases satisfy the exponential condition (1) 
and nearly satisfy (2). Let Ea,n denote the partial sums in (|8. 4.61 1 and (18.4.7b . 
Then 
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Ax = q(x + 1) D a (E a ,n) = AE a ,n-i 

Ax = x + 1 A(E AjN ) = AE A/N _i 

8.5 A-interlacing and the symmetric derivative 

We say that f and g A-interlace if f + ag G P A for all a € R. If the degree of 
f is greater than the degree of g and f and g A-interlace then we write f <J A g. 
We give a criterion for affine interlacing, and use it to determine when the 
symmetric affine derivative is in P A . 

Lemma 8.19. J/f eP 4 and f < A 2 f then f and Af A-interlace. 

Proof. From the interlacings f <|C Af, f <|C A 2 f, Af <|C A 2 f we conclude that if a > 

o 

f <f + oAf 
f <Af + aA 2 f 

and if a < 

f >f + aAf 
f >Af + aA 2 f 

Thus f + aAf and A(f + aAf) always have a common interlacing, so we find 
that f + aAf e P A . ' □ 

Here is an example of polynomial Q that A-interlaces AQ . If A is increasing 
on R then define 

Q = (x + l)(x + A 2 l)(x + A 4 l)---(x + A 2n l) 
and if A is decreasing define 

Q = (x-l)(x-A 2 l)(x-A 4 l)---(x-A 2n l) 

If we apply A 2 to Q we have even powers from A 2 to A 2n+2 , so clearly 
Q<A 2 Q. 

Here is a variation on the lemma for two polynomials. 
Lemma 8.20. Iff < g and f g e P A then f < A g. 
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Proof. Suppose that the roots of f(x) are fj < • • • and the roots of g(x) are 
gi < • • • . The hypotheses imply that the roots appear in the order 

■ • • g t ^ Ag t < fi ^ Afj < g i+1 ^ Ag i+ i < Af x • • • 

from which it follows that f <C Ag. For positive a (first column) and negative 
a (second column) we have 

f<f+ag f>f+ag 
f«Af + aAg f>Af+aAg 



Consequently, f + Ag £ P A . □ 

Clearly f < A g implies that f < g but < A is a more restrictive condition 
than < , and the converse does not usually hold. We say that f < A g if f <. A g, 
and f and g have no common factors. Using sign interlacing we can replace 
< and < A in the lemma by < and < A . There is a simple criterion for such 
interlacing. 



Lemma 8.21. Suppose that f < g. Then f < A g if and only if 



f g 



is never zero. 



Proof. From Lemma [l. 521 we have that f + ag <C Af + aAg for all a € K. This 
means that f + ag € P A so the conclusion follows. □ 

The symmetric derivative is 



sun,,,, A(f)-A-!(f) 



For Ax = x + 1 this is the symmetric difference 

D 



s-ym _ f[x + l) -f(x- 



and for Ax = qx it is 



« SS m,„ f(qx)-f(x/q) 



1/q x 



Lemma 8.22. If f £ P A then D* um f £ P. If in addition we have that f < A 2 f then 

D sVm fepA 

Proof. Since f £ P A we know that f -C Af and consequently f — Af £ P, so 
D^ m f £ P. Next, assume that f < A 2 f. We have the interlacings 

A _1 f <f <Af 
A _1 f <A~ J f <Af 
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which imply that 

f <Af- A _1 f 
A~ J f <Af- A~ : f 

The last two interlacings imply Af — A _1 f G P A , and so the conclusion follows. 

□ 

Here's an example that shows that the symmetric derivative is not in P A if 
the second condition is not satisfied. If Ax — x + 1 then 



Dl vm (%L = n(x--^)(x-l) 



n-1, 

n-2- 



If n is odd then the symmetric derivative has a double root, and so is not in 
P sep . 

8.6 Linear transformations 

In this section we investigate transformations that map some subset of P to P A . 
The key idea is the iteration of the linear transformation f i— > (x— ct)A _1 f — |3f. 

Lemma 8.23. The linear transformation S: f i— > (x — ct)A _1 f — (3 f where |3 ^ 
maps 

S' p a n p'"' 00 ' > p A n p' 01 ' 00 ' 

Proof. Since f e P A we know 

A _1 f <f 

and using f e pf"- 00 ' we get 

(x-a)A -1 f <A -1 f. (8.6.1) 

(x-a)A _1 f^f. (8.6.2) 

By hypothesis (3 ^ 0, so we can apply Corollary ll.30l to d8.6.2|) and Lemma H.311 
to (|8.6- 1|) yielding 

(x- a)A" x f- |3f <f 
(x-a)A -1 f- pf <A -1 f 

which shows that S(f ) G P A . Since A _1 f G p( a '°°» we can apply Corollary[L30] 
to 

(x - a)A _1 f - |3f < (x - a)A _1 f 
and conclude that S(f) G P< a '°°>. □ 



CHAPTER 8. AFFINE TRANSFORMATIONS OF POLYNOMIALS 



226 



Now, define polynomials by Q"' 13 (x) = ( (x — oc)A _1 — |3 ) n (1). Some par- 
ticular examples are given in Table 18.11 They are immediate from their defi- 
nitions, except for the Charlier polynomials, where we apply the recurrence 



Ax 


OL 


(3 


Qn 15 


x + 1 










x + 1 





P 


c£(x) 


qx 








q-(?) x n 


qx 


-1 








Table 8.1: Polynomials of the form Q"' 



Proposition 8.24. The mapT: x n i— > Q" (x) z's a //near transformation 

p( — P,oo) > p(a,oo) p| pA 



Proof. If we define S(f ) = (x — a)A 1 f — (3 f , then from the definition of T we 
have that T(x n ) = S n (l) = Q£' p , and so by linearity T(f) = f(S)(l). If we 
choose f = Yli x ~ r t) m P' _|3,00 ' then each root r\ satisfies i*i ^ — 13. Notice 
that 

T(f) = Y\(S - n) = Y\((x - a)A- : - |3 - n)(l) 
Since (3 +n > 0, if follows from the lemma that T(f ) € P'"' 00 ' n P A . □ 

Here is a general criterion to tell if a transformation preserves P A . See 
Lemma [8361 for an application. 

Lemma 8.25. IfT:P — > P preserves interlacing, and TA = AT, then T: P A — ► 
P A . 

Proof. Assume that f <f; Af. Since T preserves interlacing we have Tf <c TAf . 
Commutativity implies that TAf = ATf, and so T(f ) <|C AT(f ), and hence 
T(f ) e P A - □ 

8.7 The case Ax = x + 1 

In this section we study the affine operator 1 Ax = x + 1. The space of all 
polynomials f satisfying f (x) <C f (x + 1) is denoted P sep , where sep stands for 
separated. Here are some necessary and sufficient conditions for a polynomial 
f to belong to P sep : 



This operator is often denoted by E. 
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1. f(x)<f(x + l) 

2. The distance between any two roots of f is at least 1. 

3. f (x) and f (x + 1) have a common interlacing. 

4. For all positive |3 we have f (x) + |3f (x + 1) € P. 

For example, both (x) n and (x) n are in P sep . The affine derivative is the 
difference operator A(f) = f(x + 1) — f (x). The difference operator commutes 
with A. Since the derivative maps P sep to itself, the roots of the derivative 
can't get too close: 

Lemma 8.26. If the distance between any two roots of a polynomial f in P is at 
least 1, then the distance between any two roots of the derivative f is also at least 1. 
Similarly, the distance between any two roots off [x + 1) — f (x) is at least one. 

Proof. We know that f e P sep implies that both f ' and Af are also in P sep . □ 

If we express a polynomial in the basis (x) n , and use the fact that A(x) TL = 
n (xJn-i we conclude 

If Y_ nas roots that are separated by at least one, then the 

same is true for £j iat(x) i _ 1 . 

From Lemma [8T51 we find that differences preserve interlacing: 

Lemma 8.27. J/f, g e P sep satisfy f < g then Af < Ag. 



Corollary 8.28. If g e P alt and f € P sep then g(A)f e P sep where A(f(x)) = 
f(x + 1). Similarly, ifge P alt then g(A)f e P sep . 



The second part of this corollary does not hold for all f . For instance, if 
f = x(x + 1) then f — aAf = (x + 1) (x — 2a), and the distance between roots is 
1 + 2a\. Thus f — aAf G P sep if and only if a ^ (—1,0). This is true generally. 

Lemma 8.29. Ifi e P 1 ^'- 1 ' ' then f(A) : P sep — -> P sep . 

Proof. It suffices to show that f - aAf e P sep if f € P sep and a g (-1, 0). Note 
that 



Since a ^ (—1,0) implies > Q we can apply Corollary 18.501 to finish the 



Lemma 8.30. Suppose that d z's a positive integer. The linear transformation 
T: x k i—* ( x + d - k ) d maps P alt to P sep . 



Proof. Use Lemma [8.14l for the second part. 



□ 




proof. 



□ 



Here is a nice consequence of composition. 
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Proof. Let Bx = x — 1. The important observation is that B k ( x + d ) d = T(x k ). 
Thus, T(f ) = f (B) ( x + d ) d . Thus we only have to verify the conclusion for one 
factor. Since f S P alt , we have to show that (B — a) maps P sep to itself if a is 
positive. This follows from Lemma [831 □ 

Simple geometry forces a relationship between Af and f. 

Lemma 8.31. Ifi € P sep then f < Af. 

Proof. Consider Figure 18.21 The points a, b are consecutive zeros of f, and 
satisfy b — a ^ 1. Points d, e are on f, and the horizontal line between them 
has length exactly 1. Finally c is a root of f. 

The x-coordinate of d is a root of f (x + 1) — f (x) = 0, and so is a root of Af. 
Since the x coordinate of d is obviously less than c, it follows that f ' <C Af . 

f 




Figure 8.2: The difference and the derivative 



□ 

Remark 8.32. If f e P sep then we know that f < f, f < Af, f e P sep , and Af e 
P sep . Even though f < f + Af, it is not necessarily the case that f + Afe P sep . 
A simple counterexample is given by f = x(x — l)(x — 2). 

An important class of polynomials in P sep are the Charlier polynomials 
C"(x). A simple consequence of the following result is that the Charlier poly- 
nomials are in P sep . 

Corollary 8.33. Assume a < 0. 

2. The map x l i-> maps P alt — ► P alt n P sep . 

2. The map x l ^ (x) n maps P alt — ► P alt n P sep . 

3. The map x i ^ (x) n maps P pos — > P pos n P sep . 

Proof. Apply Proposition l8.24l to Table [8T] Switch signs for the rising factorial 
in the result for the falling factorial. □ 

Expressing this result in terms of coefficients yields 

If Q i x1 ^ nas a ll negative roots, then Y. a i(^)i has roots whose 
mutual distances are all at least one. 
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The following are consequences of Corollary l8.12l 

Corollary 8.34. If g e P pos (n — 1) and all roots of f are at least n apart then 
g(A)feP sep . 

Corollary 8.35. If the roots ofi are separated by at least n+1 and f e P sep (n) fen 

£ Wf(n + i)eP 



i=0 



I/f e P sep fen 



i=0 



IH) 4 J f(n + i)6P 



Proof. Apply the previous corollary to (1 + A) n f and (A — l) n f. □ 

The next result is an interesting special case of Lemma [8. 141 
Corollary 8.36. Ifg e P sep and m. is a positive integer then 



i=0 



Proof. Apply Lemma [8.14l withf fx) = x m . □ 
If we apply this result to (x) n we get 



sep 



i=0 

The following corollary of Lemma l8.15l is a slight strengthening of Corol- 
lary l73l 

Lemma 8.37. Suppose f € P sep n P pos . For positive x 

f(x+l) f(x) n 
f(x+2) f(x+l) > U 

Remark 8.38. This does not necessarily hold for all x e K. If f = x(x + 1) 
then the determinant is 2(x + l)(x + 2) which is negative for x € (—2,-1). 
However, if f = x(x + 2) then the determinant is positive for all x. More 
generally, it appears that if f(x)<Cf(x + 2) then the determinant is always 
positive. However, there are examples, e.g. x(x + 2)(x + 3.5), where f (x) and 
f (x + 2) don't interlace, yet the determinant is always positive. 

Example 8.39. Here is a family of polynomials whose definition is similar 
to the Rodrigues definition of the Legendre polynomials (See Ouestion ll31ll . 
Define polynomials by the Rodrigues type formula 



It 

i=l 



x 2 -i 2 ) 
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Since nr=i(x 2 — * s i R P sep , the polynomial p n is in p se ?, and has degree 
n. Moreover, we claim that p n +i < Pn- Since A preserves interlacing in P sep it 
suffices to show that 

AnV-i 2 )<fi(x 2 -i 2 ) 

V=l 1=1 

Now the roots of the left hand side are seen to be 

-n, — n + 1, • • • , -2, -1, -.5, 1, 2, • • • , n 

and the roots of the right side are 

-n, -n + 1, • • • ,-2,-1,1,2,- • • ,n 

and so they interlace. The p n 's are not orthogonal polynomials since they 
do not satisfy a three term recurrence. However, the p n appear to satisfy a 
recurrence involving only three terms: 



Pn+2 = (n + 2) 2 p n+ i + x(2u + 3)pn+l + 

(n + l)(n + 2) Apn+i +x(2n + 3) Apn+i -2(2n + 3)(n + l) 3 Ap n 

If we write a polynomial in the interpolation basis, then there is a simple 
estimate on the separation of roots. One consequence is that if 5(f) > then 
6(f) > 6(f). 

Lemma 8.40. Suppose that f = (x — aj) . . . (x — a n ) where aj < • • • < a n and that 



g(x) =bf(x)+^bt 




where b ^ 0. Z/bi ^ b 2 ^ • • • ^ b n > then 6(g) > 6(f). 

Proof. Choose consecutive roots <ij_i, cij, Oj+i, and points Xj,x 2 satisfying 

Qj_i < X! < aj < x 2 < Qj+i- We will show that — (x 2 ) — t( x i) > if 

|xi — x 2 | < 6(f). It follows that for |xi — x 2 | < 6(f), xi and x 2 can not be 
adjacent roots of g, so 6(g) > 6(f ). 
We calculate 

f f f— ■ x 2 - di f— ■ Xl - di 

1=1 1=1 

_ bi b n y- 1 / b i+1 bj \ 

x 2 - Qi a n - xi f-^ \x 2 - Qi+i Xi - ax) 
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Now since bi ^ bt+x, we have 

^ h h > b i+ i 

x 2 - Qi On - xi f-j \ x 2 - a i+ i X! - at 

bi b n "i- 1 (ok+i — aO — (xa— xx) 

= 1 H > bi+i— : T7 r- 

x 2 -ax a n -xj (x 2 - a i+ xJ(xi - a t J 

The first two terms are positive; the numerator is positive since x 2 — Xi ^5 6^ 
Oi+x — Qi, and the denominator is positive since Xx and x 2 lie in consecutive 
intervals determined by the roots of f . □ 



8.8 A multiplication in P sep 

The product of two polynomials in p sep is not in P sep . We can remedy this 
situation by using a different multiplication. We define the bilinear transfor- 
mation f <g> g on a basis: 



O is a diamond product - see § 1616.111 A and ® interact as expected. 
Lemma 8.41. A(f ® g) = Af ® g + f <g> Ag 
Proof. We only need to check it on a basis. 

A((x) n ® (xj J = A(x) n+m = (n + m)(x) n+ m-i 



□ 



It follows by induction that 

n 

A(fx® ■ ■ • ® f tO = ^~ f x 8 • ■ ■ ® Af j ® ■ • ■ ® f n 

o 

and in particular the difference of a product of linear terms is 

n ^ ^ 

A((x — ax) ® • • • ® (x— a n )) = ^~ (x— ax) ® • • • ® (x — at) ® • • • ® (x — a u ) 

o 

where the hat means "omit". 

Every polynomial in P pos can be factored in terms of ® . 

Lemma 8.42. Iff e P pos (n) zs monk then there are positive ax, . . ., a n sizc/z f/zat 
f (x) = (x + ax) ® (x + a 2 ) ® • • • ® (x + a n ). 
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Proof. This follows from Proposition 16.561 since the linear transformation 
T: (x) n * n maps P pos to itself. 

□ 

Remark 8.43. Here's an example of the Lemma: 

(x + 3)(x + 4) = (x + 2) <g> (x + 6) 

The Lemma shows that the product of certain terms with positive coefficients 
is in P; there are products of linear terms with positive coefficients that are not 
inP: 

(x + 1) <S> (x + 2) = (x + 1 - i)(x + 1 + i) 

If f € P alt then there might be no such representation. For example, the 
following is a unique representation: 

(x - 1 - 1) ® (x - 1 + 1) = (x - l)(x - 2) 

Lemma 8.44. J/feP sep n P alt and (3 ^ then 

(x- (3)®f G P sep nP alt . 

Proof. Let T(f) = (x - (3) <g> f. Since 

T((x) n ) = (x) n+1 - P(x) n = (xA- 1 - p)(x) n 

weseethatT(f) = (xA^-pjf. Lemma[823limplies that Tf f 1 G p se P n p alt . □ 

By iterating the Lemma, we get: 
Corollary 8.45. If a.\, . . . , a n are positive then 

(x- ai) <g> (x- a 2 ) ® • • • O (x - a n ) G P sep n P alt . 
We have seen that the following transformation preserved P alt . 
Corollary 8.46. The transformation T : (x) n i— > (x) n+ i w^aps 

psep p. palt ^ psep p palt 

Proof. Since T(f ) = x ® f we can apply Lemma [8.441 □ 

We would like to show that ® preserves P sep n P alt . See Question l90l 
This follows easily from Question [76j that the map x n — > (x) n is a bijection 
between P pos and P sep n P pos . 
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8.9 The analytic closure of P sep 

The analytic closure P sep is the uniform closure on compact subsets of P sep . 
Obviously P sep C P. Since P sep is closed under differentiation and difference, 
so is Y 1 ^. 

Lemma 8.47. The following functions are in p se V; 

1. sin TOC, COS 7TX. 

2. e ax 

3. e ax f(x)/oranyf e P sep . 
Proo/. From the infinite product 

e 
k 2 



Sin 7TX = 7TX 

k=l 



we see that the polynomials 



7TX 

k=l 



converge to sin roc and are in P sep since the roots are — n, — n + 1, . . . , n — 1, n. 
A similar argument applies to cos roc. 

Next, we show that for a > we can find polynomials p n such that 

1. Pn G P sep 

2. For every t there is an N such that n > N implies that the roots of p n 
have absolute value at least r. 

3. limpn = e ax 

We start with the identity, valid for positive |3: 

(P + l)*= lim fl(l + -J?-) (8.9.1) 

TW °° k-l ^ ^ ' 

Let p n be the product. The roots of p n are — (n/(3) — n, . . . , — (n/(3) — 1 so 
p n € P sep . This shows that (1) and (2) are satisfied. If we choose positive (3 

such that |3 + 1 = e a thenp n also satisfies (3). Thus, e 1 ** e P sep for positive 
a. Since g(x) e P sep if and only if g(-x) G P sep , it follows that e 1 ** e F 7 ^ for 
all a. 
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If f e P sep then we can find an N such that n > N implies that p n (x)f (x) G 

P sep . Taking limits show that e ax f (x) G F 7 ^ . 

Finally, if e -011 belongs to P sep , then so do all its derivatives. The Ro- 
drigues' formula for Hermite polynomials (|7. 8.1| | implies that the Hermite 
polynomials H n (x/y/oi) would be in P sep . This means that all roots of H n (x) 
would be separated by at least ^/oi. However, from [168J(6.31.21) we see that 
the minimum distance between roots of H n goes to zero as n goes to infinity. 

Alternatively, if e""' G P 1 ^ and f G P sep then differentiating e~ ax2 f and 
dividing out by the exponential factor would imply that f ' — 2ocxf G P sep . But 
if f = x(x + l)(x + 2)then6(f'-2xf) =.95. ' □ 

The following consequence is due to Riesz [165]. 

Corollary 8.48. Iff G P sep and <x G K ffren af + f ' G P sep . 

Proof. Since e ax f (x) G P sep we differentiate and find that (af + f ' ) e ax G P sep . 
The conclusion follows. □ 

Corollary 8.49. Ifg(x) ePandfe P sep then g(D)f G P sep . 

The next result is not an independent proof of Lemma 18.51 since that 
Lemma was used to show that p sep is closed under differences. 

Corollary 8.50. Iff G P sep and (3 > then |3 f (x + 1) - f (x) G P sep . 

Proo/ Note that A(e ax f) = e ax (e a f (x + 1) - f (x)). □ 

As long as function in P has no factor of e~ ax2 , and all roots are at least 
one apart then f is in P sep . 

Lemma 8.51. Assume that a n+ i — a n ^ lfor n = 1, 2, 

1. Iff (x) = e ax n" a (l-^)eP tfien f G P 7 ^. 

2. If f(x) =e ax nr=i [(l-^) e- x/ 

3. 1/F(x) G P^ 5 . 



G P tafe P sep . 



Proof. The assumptions in (1) and (2) guarantee that the product converges. 
Part (1) follows from Lemma I8.47r 3). From the proof of this lemma we let 
p(a,r) g P sep be a polynomial whose roots have absolute value at least |rj, 
and limr^oopfc^r) = e ax . Consider the polynomial 



x) =p(a,r ) Yl ( (1- — Jp(-l/ai,Ti 
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We can choose the X\ sequentially so that q n <E P sep . Taking limits yields (2). 
The Weierstrass infinite product representation 

- oo 

TW-» T -n[(i + ;)«^"] 

shows that e P . Since the roots are at negative integers, part (2) implies 
that l/r(x) e P 7 ^. □ 

Since is in P sep , so are all the differences A n p(^j. We can find a recur- 
sive formula for these differences. If we write 

AV l Mx) 



then 



r(x) (x) n r(x) 



A n+1 1 f n (x + l) f n (x 



r(x) (x + i) n r(x + i) (x) n r(x) 

= f n (x + l)-(x + n)f n (x) 

(x)n+i r M 

and consequently the numerators satisfy the recurrence 

fn+l(x) =fn(x + l)-(x + n)f n (x) 

Since f i (x) = 1 — x, the next lemma shows that all of the numerators are in 

psep 

Lemma 8.52. Suppose that g\ (x) e P 10 ' 00 ' n P sep and the sequence {g n } is defined 
by the recurrence 

9n+i(x) = g n (x + 1) - (x + n)g n (x) 

Proo/. The function — — —p^— r is in P sep , as can be seen by removing ini- 
tial factors from the infinite product; its zeros are — n — 1, — n — 2, • • • . Since 

□ 



A n -^— = , , ^ n+ * , is in p sep , the roots of g n +i must lie in f— n,oo). 

r(x) (x) n+a r(x) 



If we consider f/r we get 
Lemma 8.53. Iff e P 11 ' 00 ' n P sep f/zen f(x + 1) - xf (x) e P 10 ' 00 ' n P sep . 
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Proof. The largest root of 1/F(x) is 0, so f(x)/T(x) G P sep . Taking differences 
yields 

f(x + l) f(x) _ f(x + l)-xf(x) 

r(x + i) r(x)' xr(x) 

and the numerator is in P sep . The largest zero of xF(x) is —1, so the conclusion 
follows. □ 

Here's a result that we get from using (x) n in place of fhr- 

Lemma 8.54. J/f G P sev n P ln '°° ] then nf + (x + l)Af G P sep n p^" 1 - 00 ). 

Proof. Since the largest root of (x) n is n — 1 it follows that (x) n f G P sep . Now 
taking the difference preserves P sep , so 

A(x) n f = fA(x) n + (Af)A(x) n 

= (x) n _! (nf +(x+l)Af) 

is in P sep . The result follows since factors of polynomials in P sep are in P sep . 

□ 

Motivated by the analogy of (x) n /n! and the exponential, we define a La- 
guerre like polynomial, and show that consecutive ones interlace. 

Lemma 8.55. //f n (x) = A n ( x + 1 )„ ( x - 1 L then f n G P sep andi ri <i n - 1 . 

Proof. If we letp n (x) = (x + 1) ^ ( x — 1 )^ then 

A Pn (x) = n(x I h2> n _ 1 (x_l) n _ 1 (l +2x). 

Observe that that Ap n ^p n _i. Since p n e p se P the conclusion follows by 
applying A n_1 . □ 

Lemma 8.56. J/T(f ) = f (x + 1) + f (x - 1) then T: P sep — ► P sep . tl80l 

Proof. Observe that T(f) = 2cos(D)f, so that T maps P — > P and preserves 
interlacing. Since T commutes with A, the result follows from Lemma 18.251 

□ 

8.10 Integrating families of polynomials in P sep and P A 

We now apply the integration results of § 1313.61 to P sep . Since polynomials in 
P sep are exactly those polynomials whose roots are separated by at least one, 
f G P sep iff 6(f) ^ 1. We will see more applications of integration when we 
consider the generalization of P sep to two variables. 

One useful way of constructing locally interlacing families is to start with 
f (x) G P sep , and form the family {f (x + t)}. The interval of interlacing is at 
least one. 

Here is a simple application. Note that the hypothesis f ' G P sep doesn't 
imply f G P, yet we can conclude that Af G P. 
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Corollary 8.57. Suppose that w(x) is positive on (0,1). The mapping 

pi 



f 



f(x + t)w(t) dt 



(8.10.1) 



o 



defines a linear transformation P sep — ► P. 

Corollary 8.58. Iff G P sep f/zen Af e P where Af = f (x + 1) - f (x). 

Proof. Choose w(t) = 1, a = 0, b = 1, and f t (x) = f '(x + t) in Proposition l3.37l 

□ 

Remark 8.59. Integration is an example of a linear transformation 
T: P sep — > P that does not preserve interlacing. For example 



f(x) =x(x + l) 

pi 



T(f) = 



f(x + t) dt 



9W 



T(g) = 



o 



g(x + t) dt 



.5 



= (x+.59)(x + 1.4) 

Thus, f < g yet Tf and Tg don't interlace. 

The following result is convolution of polynomials in P sep . 

Lemma 8.60. Iff, g € P sep then 

pi 

f(x + t)g(x-t) dt € P 

Proof. If we choose < ti < ••• < t n then let fi(x) = f (x+ti) and g n +i-i(x) = 
g(x — ti). By Lemma 13.161 we know that Y. fi 9n.+l— i € P- Taking the limit 
shows the integral is in P. 

Note: the diagram in the proof of Lemma 13.161 is essentially the graph of 
f (x + t)g(x — t). Consideration of this diagram shows that we can find f, g so 
that if h is their convolution then 5(H) is as small as desired. □ 

Since we only need separation in one variable, we phrase the next result in 
terms of mutually interlacing families. 

Corollary 8.61. if {ft} and {g t } are locally interlacing families on K with p(f ) ^ 1 
and p(g) ^ 1, and we define 



cp = 



l-t 



ft(x)g s (x) ds dt 



then O e P. 



Proof. If we define h t = J g s (x) dt then by Lemma [3.43l iH + } is a locally in- 
terlacing family with p(h) 1. Since O = f t (x)hi_ t (x) dt we see that O 

is a convolution of mutually interlacing polynomials. (O is generally not in 
psep) ' □ 
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Corollary 8.62. Iff G P sep then 



f(x + t)e _t dt G P. 



Proof. If we choose g (x) — e x which is in P sep , then we can apply the lemma 
to find that 



f(x + t)e x - t dt = e ,< 



f(x + t)e _t dt G P. 



If we multiply by e x we get the conclusion. 



□ 



We now to assume that Ax = qx where q ^ 1. We can integrate polynomi- 
als in P A since they determine sequences of interlacing polynomials. Assume 
that f (x) G P A . It's clear that 



f (x) < f (tx) < f (qx) if 1 < t < q 



(8.10.2) 



Consequently we have an interlacing family {f (tx)}. Applying the results of 
§ 1313. 6| we conclude that 



Corollary 8.63. Suppose that w(x) is positive on (l,q). The mapping 



f i 



f (tx)w(t) dt 



(8.10.3) 



defines a linear transformation P A — ► P. 
Corollary 8.64. Iff G P A then D q (f) G P. 
Proof. Immediate from the computation 
•q 



f'(tx) dt = (f (qx) - f(x))/x = (q - 1) D q (f) 



□ 



We also have convolution. If f (x) G P A then replacing x by x/q in d8.10.21 l 
shows 



f (x/q) < f (x/t) < f (x) if 1 < t < q 



(8.10.4) 



Equations | |8.10.2|| and d8.10.4b are reversed if < q < 1 . We can now apply 
Lemma [8.601 to conclude 

Lemma 8.65. If f, g G P^ where Ax = qx, and w(t) is positive on (1, q) if q > 1 
and on (q, 1) if < q < 1 then 



w(t)f(xt)g(x/t) dt G P 
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8.11 Determinants and P A 

In this section we look at connections between P A and polynomials of the 
form |xD + C| with symmetric C and diagonal D. We show that there is an 
analog of the differentiation formula ([8.2.4b where the factors are replaced by 
determinants of principle submatrices. The rest of the section is concerned 
only with Ax — x + 1. 

We assume that D is an n by n diagonal matrix with diagonal entries en, 
and C = (Cij) is a symmetric matrix. We are interested in the matrix M = 
xD + C. If M[i] is the ith principle submatrix, then 

— det(M) = ai det(M[l]) + • • • + a n det(M[n]) (8.11.1) 
dx 

The formula for the affine derivative is similar: 

Lemma 8.66. Assume all are positive. IfM[ r ] is constructed from M[r] by apply- 
ing A to the first r — 1 diagonal elements o/M[r] then 

D A det(M) =aidet(M [1] ) + --- + a n det(M w ) (8.11.2) 
For example, 

/ QjX + Cn C 12 C B 

M — \ C\2 a 2 x + C22 C23 

V C13 C23 a 3 x + c 33/ 

'a 2 x + C22 C23 
C23 Q3X + C33, 



M [i] 
M [2] 

M[3] 



A(aix + c n ) C13 

v C13 a 3 x + c 33/ 

^(QiX + Cn) C12 

. C12 A(a 2 x + c 2 2! 



D A det(M) = m det(M[!] ) + a 2 det(M [2 ] ) + a 3 det(M [3] ) 

Proof (of \8.66h Note that the term atdet(ra[y ) is the determinant of the n by 
n matrix formed from M by setting all elements of the i-th row and column 
to zero, except for the main diagonal element, which is at. Since all terms of 
the equation are now n by n matrices, we can multiply by D _1 ^ 2 on each side, 
and so we may assume that all cn are equal to 1. We now apply the following 
lemma, with y = Ax, to finish the proof. □ 

Lemma 8.67. If C is an n by n symmetric matrix, and is the nby n 

matrix whose first k diagonal elements are y and the remaining ones are x then 

det(C + xI) -det(C+uI) 



det(C + Di)[l] +det(C + D 2 )[2] + •■■+ det(C + D n )[n] (8.11.3) 
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Notice that in the special case that M is a diagonal matrix the formula 
1 18.11.21 reduces to 4823}. The limit of 1 18.1 1.31 as y -> x is l l8.ll.ll . 

For the remainder of this section we restrict ourselves to Ax = x + 1 and 
psep y _ Y[[x — Ti) then the polynomials f/(x — TjJ are mutually in- 
terlacing. It is easy to find examples where the determinants of the principle 
submatrices of |xl + C| are not mutually interlacing. However, if |xl + C| is 
in P sep then we can find a family of mutually interlacing polynomials that 
generalize the f [ k] of d8.3.5b . 



Let M = xl + C and write M. = 



M[l] 



. Then 



A(x + cii) v 




X + Cll V 




1 


V* M[l] 




v t M[l] 


+ 


M[l] 



|M| + |M[1]| 



Since | M | < | M [1] | we see that 



|M|< 



ix + en, 



v 

M[l] 



If we let M[ r j be the result of applying A to the first r diagonal elements of M 
then inductively we see that 

|M.| ^ |M[i] | ^ |M [2 ] I ^ • • • <|M [n] | = A|M| 

Thus, if |M| e P sep then |M| < A|M| and consequently the following polyno- 
mials are mutually interlacing 

{|M|,|M [1] |,|M K || / "- / |M [n _ 1] | / A|M[} 

Every factor of a polynomial in P sep is in P sep , but the analogous property 
does not hold for principle submatrices. For instance, the matrix M below has 
the property that & ( | M [k] | ) < 5(|M|) f or k = 1,2,3,4,5. 



/x + 2 4 6 6 

4 x+2 5 8 
6 5x7 
6 8 7 x+2 
\ 5 6 6 4 

However, the roots are always distinct. 



5\ 
6 
6 
4 



Lemma 8.68. If M = xl + C zs a matrix whose determinant has all distinct roots, 
then the determinants of all principle submatrices have all distinct roots. 

Proof. The determinant of a principle submatrix interlaces the determinant of 
M. □ 
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8.12 The case Ax = qx with q > 1 

In this section we consider the affine transformation Af (x) = f (qx) where we 
first assume that that q > 1. We know that f <C Af implies that f £ P alt . There 
are simple conditions on the roots that characterize such polynomials. If the 
roots of f are {ai} then the roots of Af are {a-i/q}. If f G P alt then since q > 1 
the ordering of the roots is 



The ratio of consecutive roots is at least q. Conversely, if f £ P and the 
ratio of consecutive roots is at least q then f <C Af. 

The affine derivative is called the q -derivative, and is given in igTzTl The next 
lemma specializes the results in Section IB~51 



Corollary 8.69. Suppose Ax = qx where q > 1. The following are true: 

1. Iff G P A then f £ P alt . 

2. Iff £ P A then f G P 4 . 

3. Iff £ P A then D q f £ P A . 

4. Iff «g G P A then D q f < D q g. 

5. Ifge P alt andf £P A then g(A)f £ P A . 

6. Ifg G P alt andf £P A then g( D q )f G P A . 

For example, if we take g = (x — l) n then the following polynomials are in 



In order to apply Lemma 18.141 we need the following formula for higher 
derivatives that is easily proved by induction: 



Ql/q < Qi < Cl2/q < Q2 < • • • 






q u g(q x x). 



iff) is the Gaussian binomial coefficient defined by 




where as usual [n] ! = [n] [n — 1] ■ ■ ■ [1] . 
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Corollary 8.70. IfgGP and m is a positive integer then 

i=0 ^ ' 1 

Lemma [8.15l implies that 

Lemma 8.71. Sup-pose f eP A nP pos and q > 1. For aZZ positive x 

f(qx) f(x) „ 
f(q 2 x) f(qx) 

8.13 The case Ax = qx with < q < 1 

The cases q > 1 and < q < 1 are closely related. Let A q (x) = qx, and 
Ai/q (x) = x/q. Consequently, if f G P then 

f (x) < A q f (x) f (-x) < A Vq f (-x) 

The transformation x i— > — x is a bijection between P A for q > 1 and P A for 
0<l/q<l. 

For the rest of this section we assume that < q < 1. 
The fundamental polynomials in P A are 

(-x; q) n = (1 + x)(l + qx)(l + q 2 x) + • • • (1 + q^x) 

which have roots 

_q-(n-l] < -q-'^- 2 ) < . . . < -q- 1 < 1 

Note that D q (x;q) n = [n](x;q) n _i. 

We can translate the results of Lemma [8. 691 

Corollary 8.72. Suppose Ax = qx where < q < 1. The following are true: 

1. Iff G P A theni G P vos . 

2. Iff g P A theni' G P A . 

3. Iff G P A then D q f G P A . 

4. I/f «g g P A then D q f < D q g. 

5. I/ge P pos and f G P A fen g(A)f G P A . 

6. I/g G P pos andf G P A fen g( D q )f G P 4 . 
Lemma [8.15l implies that 

Lemma 8.73. Suppose f G P A n P pos and < q < 1. For all positive x 

f(qx) f(x] „ 
f(q 2 x) f(qx] 
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8.14 The q-exponential 

In this section we continue to assume that < q < 1. We will use the q- 
exponential Ea of (18.4.21 to derive properties of the q-Laguerre polynomial. 
We have already seen E q (x) in ( |5.1.H . E q (x) is an entire function if |q| < 1. Us- 
ing the product representation it is easy to verify that D q E q ( ax) = aAE q ( ax) . 

From 115.1. 11 we see that all the terms have positive coefficients, and the 
roots of the q-exponential Exp q (x) are {— q~ n }™ =1 , so clearly we have 

E q (x)< AE q (x) 

Thus, E q is an entire function that is in the closure of P A . 
The Rodrigues' formula for Laguerre polynomials is 

L n (x) = ^iD n (e-V) 
e x n! 

We define the q-Laguerre polynomials by the formula 

I^(x) = ^ ra D q -(E q (-x)(-x;qk) 

However, since E q (— x) ^ P A , it is by no means obvious that this is even a 
polynomial, let alone that it is in P A . To this end, define 

S(n) = {f -A n (E q (-x)) |f eP A } 
We can control the action of D q in S(n): 
Lemma 8.74. For any non-negative integer n, D q : S(n) — >S(n + l) 
Proof. Apply the q -Leibnitz formula to the product: 

D q (f -A n E q (-x)) = D q f -A(A n E q (-x))+f • D q (A n E q (-x)) 
= D q f -A n+1 E q (-x)-q n f -A n D q E q (-x) 
= ( D q f-q n f) -A n+1 E q (-x) 

Since D q f - q n f e P A by Lemma|H3]it follows that D q (f • A n E q (-x)) is in 
S(n + 1). □ 

Corollary 8.75. The q-Laguerre polynomials are in P A . 

Proof. Since (— x;q) n e P A , it follows that (— x;q) n • E q (-x) is in S(0). The 
Lemma above shows that D q ( [— x; q ) n ■ E q (— x)] is in S (n) . This means that it 
has the form g ■ A n E q (— x) where g is in P A , which proves the corollary 

□ 
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There is a simple formula for the q-Laguerre polynomials. 

There are several q -analogs of the Hermite polynomials 111031 . The variant 
that we use is motivated by the desire for a recursive formula similar to 07.8.61 1. 
We set H$ = 1, = x and 

H£ +1 =xHH-q n [n]H£_ v 

Since the coefficient of H^_ 1 is negative whenever q is positive, we see that 
H n has all real roots for all positive q. From the definition it is easy to show 
by induction that 

D q H< = [n] 

which is the desired analog of d7.8.6b . If we combine these last two equations 
we find 

H«=lx-q n ~ 1 D q )(x-q n - 2 D q q)...(x- D q )(l). 

which is the q-analog of the identity H n = (2x — D) n (l). 

The q-hermite polynomials can not be in P A since they have both positive 
and negative roots. However, they appear to be nearly so. See Question l83l 

8.15 q-transformations 

In this section we show that certain linear transformations associated to q- 
series map appropriate subsets of P to P. From earlier results specialized to 
the case at hand we have 

Corollary 8.76. Let Ax = qx where q > 1. 

1. The linear transformation x n i— ► q( 2 )x n maps P alt — ► P alt n P A . 

2. The linear transformation x n (— x; q) n maps P alt — > pd- 00 ) p 

We next study the latter transformation and its inverse for all values of q . 
Define 

T q (x l ) = (x; q)i = (1 - x)(l - qx) • • • (1 - q^x) (8.15.1) 
We will show the following: 
Theorem 8.77. J/T q is defined in d8.15.ll l, then 

1. IfO < q < 1 then T q : P™' — > P RMai) . 

2. If < q < 1 then T" 1 : P m] — ► P 10 - 1 '. 

3. 1/1 < q then T q : P W] — ► P W] . 



CHAPTER 8. AFFINE TRANSFORMATIONS OF POLYNOMIALS 



245 



4. J/1 < q then T" 1 : P MMai) — P™'. 

The idea is to discover the recursion relations satisfied by T q and T~ : . We 
will show that T( 1 / q ) = T^ 1 , and this allows us to prove the results about T^ 1 . 
From the definition 



Tqtx-x^ 1 ) = T q (x n ) = (l-x)(l-qx)(l-q 2 x)---(l-q n - 1 x) 
= (l-x)A((l-x)(l-qx)---(l-q— 2 )) 
= (l-x)AT q (x n - 1 ) 

which by linearity implies 

T q ( (x + b)f ) = (1 - x) AT q (f ) + bT q (f ) (8.15.2) 

Similarly, 

*T q (x-) = ^(T q (x-)-T q (x- +1 )) = T q {{l-x)A- 1 x n ) 

T- 1 (xT q (x n )) = (l-x)A-V 
V (xT q (g)) = (l-x)A- 1 g 

Setting g = T q _1 (f) yields 

T- 1 (xf) = (l-x)A- 1 T- 1 (f) 

Applying this last recursion to x n yields 

VV*) = (i-i)(i-i)-..(i-^) = W*») 

Consequently we have the equality T^ 1 = Ti/ q . This relation shows that 
part 1 of Theorem 18.771 implies part 4, and part 3 implies part 2. 
The next Lemma establishes properties of the recurrence l|8. 15.21 . 

Lemma 8.78. Suppose that < q < 1, b is positive, A(x) = qx, and define 

S(f) = (l-x)Af + bf. 
Then iff e P [1 -°° ] and A(f ) < f then S(f ) G P 11 - 00 ', and AS(f) < S(f). 

Proof. Since A(f ) <c f and all roots of f are greater than 1 it follows that (1 — 
x)A(f)<|;f. By linearity we know that (1 —x)A(f) +bf + yf € PforanyY,and 
consequently S(f) < f. This also shows that S(f ) G P. Since f € p 11 - 00 ' we know 
that S(f ) has at most one root less than 1. It's easy to see that the coefficients of 
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S(f ) alternate, so S(f ) <E P Q . We need to show that S(f ) has no roots in (0, 1). 
The leading coefficients of f and S(f) alternate in sign, and it's easy to check 
that f is positive asu — oo. Since all roots of f are greater than 1, it follows 
that f is positive on (0, 1). Similarly, A(f ) is positive on (0, 1). Finally 1 — x is 
positive on (0, 1), and so S(f ) is positive there. This implies that S(f ) € p' 1 - 00 '. 

It remains to show that A(S(f))<cS(f), and we do this by analyzing the 
location of the roots of S(f). Suppose that the roots of f are r\ < ■ ■ ■ < r n . The 
roots of Af are ^ < • • • < and since Af <f; f we have the ordering 

l<ri< — <r 2 < — <■■■ <r n < — 

q q q 

The signs of f and Af are constant on each of the intervals defined by consecu- 
tive terms of the above sequence. Now 1 — x is negative for x > 1, so the roots 
of S(f ) lie in the intervals where f and Af have the same sign. Since both f and 
Af have the same sign on oo) the roots of S(f ) are located in the intervals 

— ,r 2 ), — ,r 3 , ■•• ,r n , — ,oo 

q q q q 

Since S(f) < f there is exactly one root in each interval. The smallest possible 
ratio between the first two roots of S(f ) is found by taking the largest possible 
first root, x%, and dividing it into the smallest possible second root, ^. This 

ratio is jr, and the same argument applies to all the possible ratios. Since all 

consecutive ratios of the roots of S(f ) are at least j?, we find that ASff ) <C S(f ). 

□ 

Proo f of Theorem [87771 We begin with part 1, so assume that < q < 1 and 
f G P' 1 ' 00 '. Now T p ((x + b)f ) = S(T q (f)) by so by induction it suffices 

to prove our result for f linear, which is trivial. The case for f 6 P pos is similar 
and omitted. 

The proof of part 3 is entirely similar to part 2 - we need a modified ver- 
sion of Lemma \8. 78 1 - and is omitted. As observed above, the remaining parts 
follow from 1 and 3. □ 

8.16 Newton's inequalities for Ax = qx and Ax = x + 1 

Suppose Ax = qx where q > 0. The coefficients of a polynomial in P A satisfy 
constraints that are the q -generalizations of Newton's inequalities. The key 
observation is that P A is closed under reversal. 

Lemma 8.79. If Ax = qx, q > and f e then f Tev G P A 

Proof. If the roots of f are 

ti < r 2 < • ■ • < r n 

then the roots of f rev are 

11 1 
— < <•••<—. 



CHAPTER 8. AFFINE TRANSFORMATIONS OF POLYNOMIALS 



247 



Since f 6 P the ratio of consecutive roots of f satisfies Ti+i/r-i ^ q for 1 ^ i < 
n. The ratio of consecutive roots of f rev satisfies 

1Ak ^ 



1/ric+i 

and so f rev S P A . □ 
Proposition 8.80. If Ax = qxand we choose f (x) = ao + - • • + a n x n in P A (n) then 

4+1 (8-16.1) 



Q k a k+2 (q + 1) 2 [k + l][n-k-l] 

/n addition, 



<+1 >~^J^^ (8-16.2) 



a k Qk+2 q (q + 1) 2 [k+l][n-k-l 
Proof. We follow the usual proof of Newton's inequalities. Write 



f(*) = 2>U) x 

k=0 V / 1 



k 



where (£) = [n]!/([k]![n — k]!). The q -derivative of f satisfies 



k=i x 7 i 



,k— 1 



Consequently, if we then apply the q -derivative k times, reverse, apply the 
q -derivative n — k — 2 times, and reverse again, the resulting polynomial is in 

pA. 

[u]-..[n-k+l]fb k (T) + b k+i(T) x + b k+2 (T) x 2 ) (8-16-3) 



V U / q V 1 / q m ' 

After removing the constant factor, the discriminant satisfies 

(q+l) 2 b 2 +1 -4b k b k+2 ^0 

Substituting b k = a k /(£) and simplifying yields the first conclusion. 

Now suppose that < q < 1. The roots of i8. 16.31 1 are negative, and the 
coefficients are positive. Let 



-(q + l)b k+1 - J(q + l) 2 b 2 +1 - 4b k b k+2 

Tl = n 

2b k+2 



-(q + l)b k+ i + J(q + l) 2 b 2 +1 - 4b k b k+2 
2b k+2 
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Since J8.16.3b is in P A we know ri ^ q T2 so 



< tz - q ti = (q + 1) ( (q - l)b k+1 + J (q + l) 2 b 2 +1 - 4b k b k+2 /2b k+2 



Simplifying yields qb^. +1 ^ b k b k+ 2 which establishes the second part 



Remark 8.81. The product 



n-1 



H(x+q k )=X 



k=0 



n 



x~| I 

k=o 



□ 



(8.16.4) 



has Newton quotients 



1 [k + 2][n-k] 



q [k+l][n-k-l]' 
Numerical computation suggests that this is probably the best possible bound. 



Example 8.82. If q = 1 then4/(q + l) 2 = 1 and [n] = n, so J8.16.lb becomes the 
usual Newton inequalities 1 14.3.11 . Ifwetakef(x) = Qo + aix+Q2X 2 + a3X 3 e P A 
then 



k = l 
k = 2 



QT ^-.(1 + q + q 2 ) ^ 3 



a a 2 (q + 1) 



aia 3 (q + 1) 2 



(1 + q + q 2 ) ^ 3 



The bound 3 is the same as the usual Newton's inequality bound, and is real- 
ized if q = 1. If q ^ 1 then the bound is better. For example, if q = 2 then the 
ratio is at least 3 J. 

Although the reverse of a polynomial in P sep generally isn't in P sep , there 
are also inequalities for the coefficients of polynomials in P sep n P pos . The 
idea is to apply Proposition 13.221 to the interlacing f <J Af. For example, if 

f = ao H + CL3X 3 e P sep D P pos then from the Proposition the following is 

totally positive: 



f a\ + a 2 + a 3 2a 2 + 3a 3 3a 3 

ao ai a2 a3 

ai + a2 + a3 2a2 + 3a3 3a3 

a a a a 2 a 3 

V ; ; ; ; ; 
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from which we conclude that a n _i ^ (£) a n - 
8.17 The case Ax = — x 

In this section we look at the transformation A(x) = — x. This is a degenerate 
case, but the associated linear transformation is interesting. We can character- 
ize the elements of P A in terms of pairs of interlacing polynomials. 

Lemma 8.83. IfA{x) = — x and f(x)<f(— x) then there are f ,h inP pos such that 
fo < — fi and f = fo(x) fi(— x). Conversely, given fo,fi satisfying these conditions 
then f (x) fi (— x) is in P A . 

Proof. Write f(x) = fo(x)fi(— x) where fo(x) contains all the negative roots, 
and fi(— x) all the positive roots. Since f (— x) = fo(— x)fi(x) the result follows 
easily from consideration of the interlacing of f (x) and f (— x). □ 

The affine derivative of f (x) is (f (x) — f (— x))/2x. If we write 





f(x) 



Qq + QiX + Q2X 2 + a3X 3 H 



then the affine derivative is 



D A (f) 



- (qi + a 3 x 2 + a 5 x 4 H ). 



If we recall the odd part of f 



Ql + Q3Z + q 5 z 2 H 



then we can write 



D A (f) 




An unexpected consequence is that 



D A ( D A (f) ) = 0. 



We saw the odd part (and the even part) of f in § 1717.151 Since the roots 
of D A (f) are the square roots of the roots of f we see that in order to have 
D A (f) £ P we must have that the signs of D A (f) alternate. See Lemma [2. 61 
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One result of these computations is that if the degree of f is even then the 
degree of Da(t ] is two less than the degree of f, and so does not interlace f. It 
is also easy to verify that we do not have interlacing if the degree is odd. 

There is a linear transformation associated with A that maps P pos to P A . 
Define 

T(x n ) = x ■ Ax • A 2 x • ■ • A n_1 x = (-l)(?)x n 

Unlike linear transformations P — ► P (Corollary ll.50|l , the leading coefficients 
of this transformation neither alternate, nor have the same sign. 

Lemma 8.84. J/T(x n ) = (-l)(")x n then 

1 J.pPOSypPos > p 

2. I/f G P pos (2n) then (Tf)(x)< (Tf)(-x). 

3. I/f G P pos (2n + l) then (Tf)(-x)< (Tf)(x). 
Proof. We first get the recurrence relation for T. 

T(xx n ) = (-l)( n 2 +1 )x n+1 =x(-l)(?)(-x) n = x(Tx n )(-x) 

and by linearity 

T(xf)=x(Tf)(-x) 
T((x + a)f) = x(Tf)(-x) + a(Tf)(x) 

We use this recursion to prove the lemma by induction on the degree of f . 
For n = 1 we chose a > 0. Now T(x+ a) = x+ a so we have that — x+ a «x+a 
since a is positive. 

We will do one of the cases; the others are similar. Suppose that g = 
(x + a)f G P pos (2n). Then f G P pos (2n - 1) and by induction hypothesis 
(Tf)(-x) < (Tf)(x). We want to show that (Tg)(x)< (Tg)(-x). From the re- 
currence relation 

/ Tg(x) \ _ / (Tg)(x) \ _ / x(Tf)(-x) + a(Tf)(x) \ 
\A{Tq[x))J {(Jg)(-x)J ^-(x(Tf}(x) + a(Tf)(-x)y 

fx oW(Tf)(-x)\ 
" {a -xj { (Tf)(x) ^ 

Tg ^ ATg follows from Corollary 13. 541 since ATf <C Tf . 

□ 



CHAPTER 8. AFFINE TRANSFORMATIONS OF POLYNOMIALS 



251 



8.18 The case Ax = 



It is surprising that the degenerate case Ax = leads to interesting results. 
The class P A is empty, since Af = f (0) is a constant, and can not interlace 
f . However, there is a class of polynomials Ao that are based on the affine 
derivative and its conjugate. 

In this case the affine derivative is also called the lower truncation operator 
since the effect of Da is to remove the lowest term: 

x — 

D L (ao + aix H h a n x n ) = ai + a 2 x H h a n x n_1 

The conjugate of the lower truncation operator by the reversal operator is the 
upper truncation operator 

Du(ao + ii* H 1- <inX n ) = a + aix H h a n _ix n_1 

Neither of these operators preserve roots (See p. 16361 ). This leads us to con- 
sider a set of polynomials for which both operators preserve roots. 

Definition 8.85. Ao is the largest set of polynomials in P pos with all distinct 
roots such that D u (Ao) C A and D L (A ) C A . 

It is unexpected that there is a characterization of the members of Ao in 
terms of coefficients, and that this characterization is a kind of converse of 
Newton's inequalities (Theorem 14.81 1. 

Corollary 8.86. A polynomial f = Y_ a i xl i n P pos with all distinct roots is in Aq if 
and only if 115.4.11 holds. 

Proof. If f G Ao then by applying the right combinations of Du and Dl we find 
that a,_i + a^x + C4 + ix 2 is in Ao- Since all polynomials in Ao have all distinct 
roots the inequalities follow. 

Conversely, suppose that the coefficients of f satisfy (|5. 4.1| |. If we apply 
either Dl or Du to f then the resulting polynomial g has the same coefficients 
as f, except for either ao or a n . Consequently, g also satisfies the conditions of 
Theorem 15.191 and so has all real roots, all distinct. □ 

The class Ao satisfies the usual closure properties 

Corollary 8.87. Ao is closed under differentiation and the exponential operator. 

Proof. If f = Y_ is m Ao then we only need check that the coefficients of f ' 
and EXPf satisfy 1 15.4. It . This is immediate from 1 15.4.11 and 

i 2 a? ^4(i-l)(i + l)Qi_iQ i+ i 

a l > a a t-i at+i 
(i!) 2 " (i-l)!(i + l)! 

□ 
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It is not the case that Di_(f) and f interlace. In fact, they are as far from 
sign interlacing as possible. If r is a non-zero root of f € F ± , then Di_(f)(r) = 
— f (0)/r, and so D L (f ) has the same sign at all roots of f. 
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Chapter 




Polynomials in two variables 



In this chapter we generalize our results from polynomials in one variable to 
polynomials in two variables. Our goal is to generalize P and P pos , to define 
interlacing even though there are no roots, and to use these facts to deduce 
properties of polynomials in one variable. 

9.1 The substitution property and determinants 

All the polynomials that we will consider in this chapter satisfy a property 
called substitution. This is the analog of "all real roots" for polynomials in two 
variables. However, this alone is not sufficient; we will need another condition 
to generalize P. 

Definition 9.1. If f is a polynomial in variables x,y then f satisfies x- 
substitution if for every choice of a in M. the polynomial f (x, a) has all real 
roots, and the number of roots is the same for all a. We say that f satisfies 
substitution if it satisfies x-substitution and y -substitution. We let Sub2 be the 
set of all polynomials in two variables that satisfy substitution. 

A polynomial can satisfy x-substitution and not y -substitution. Figure [9TT1 
shows the graph f(x,y) = of such a polynomial of total degree 2. Every 
vertical line meets the graph in two points, so f satisfies y -substitution. Since 
there are some horizontal lines that do not meet the graph, f does not satisfy 
x-substitution. 

In the remainder of this section we discuss the substitution properties of 
polynomials that are defined by determinants. 

Example 9.2. We start with an example of a polynomial that satisfies a differ- 
ent kind of substitution. Recall that the eigenvalues of a symmetric matrix are 
all real. Choose symmetric matrices A, B and consider the polynomial 

f{x,y) = |I + xA + yB| (9.1.1) 
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Figure 9.1: The graph of a polynomial satisfying y but not x-substitution. 

where I is the identity matrix. We claim that f (x,y) satisfies the property: 

• For any a, (3 the polynomial f (az, 8z) has all real roots. 
Indeed, we see that 



Since ocA + (3B is symmetric, the roots of g(z) are given by — r~ : where r is an 
eigenvalue of aA + (3B. 

It is not the case that |I + xA + yB| satisfies substitution. Here's a small 
example: 



and the latter polynomial has two complex roots. Figure 19.21 shows why ev- 
ery line through the origin will meet the graph, yet there are horizontal and 
vertical lines that miss it. 

Example 9.3. We again start with equation 119.1. It . Now assume that 

are invertible and symmetric 



g(z) = f (az, (3z) = |I + z(aA + 6B) 




f(x,y) = |I + xA + yB| = 1 + 6x - 8x 2 + 4y - 20xy - 12y 2 
f(x,l) = -7- 14x-8x 2 



We claim that 

• For any a, (3 the polynomial f (az, (3z) has all real roots. 

• f (x,y) satisfies x and y substitution. 
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^^^^^ 

Figure 9.2: Meets all lines through the origin, fails substitution 



Choose ael and observe 

f(x,a) = |A| • |xl + (aA _1 B + A" 1 )) 

Since A _1 B and A are symmetric by hypothesis, aA _1 B + A -1 is symmetric. 
Thus the roots of f (x, a) are all real since they are the negative of the eigenval- 
ues of a symmetric matrix. For f(oc,x), notice that B _1 A is symmetric, since 
it's the inverse of a symmetric matrix. 

It is easy to find matrices that satisfy the assumptions of this example: take 
A and B to commute. For example, we take 

-1-1 1 \ 
-1 -3 
0-2 1 
-3 1 1 J 

and let B = A 2 / 10. Figure l93l shows the graph of the determinant, where the 
dot is the origin. The segments are not linear, even though they appear to be. 
It is clear that it satisfies substitution, and every line through the origin meets 
the graph in 4 points. 

Example 9.4. Next, we assume that A, B are symmetric and positive definite 
n by n matrices. We claim that f (x,y) in 1 19.1.11 satisfies these properties: 

• For any a, |3 the polynomial f (az, |3z) has all real roots. 

• f satisfies x and y substitution. 

• All coefficients of f are positive. 

The first part follows as before since A, B are symmetric. Next, A is positive 
definite, and so it has a positive definite square root. Let F_ 2 = A where E is 
positive definite. Factoring out |A| yields 

f(x,u) = |A| ■ xl + y E _1 BE _1 + A -1 
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This representation shows that f (x, a) has all real roots. Factoring out B shows 
that f(oc,u) has all real roots. The fact that all the coefficients are positive 
follows from a more general fact 11611 . 

Lemma 9.5. Suppose that A\,. . . ,A d are positive definite. All the coefficients of 
\x\A\ + • • • + x d A d | are positive. 

Proof. We prove it by induction, and it is immediate for d = 1. Assume all 
matrices are n by n. If I is a subset of {1,2, . . . ,n], and M is a matrix, then 
M [I] denotes the matrix composed from the rows and columns of M listed in 
I. Since Aj is positive definite, we may conjugate by A 1 , and so assume 
that Ai = I, the identity. Upon expanding the determinant we see that 

Ixil + (x 2 A 2 + • ■ ■ + x d A d )| = Y_ (*2A 2 + • ■ • x d A d )[I] 

i 

Now any principle submatrix of a positive definite matrix is positive definite, 
so all the terms in the sum are positive definite. Thus, by the inductive hy- 
pothesis, all the summands have positive coefficients, and so the lemma is 
proved. □ 

The following is a sort of converse to the above, in that it allows us to de- 
termine that the eigenvalues are positive from properties of the polynomials. 

Lemma9.6. IfAisamatrixand\l+xA\ e pP° s then A has all positive eigenvalues. 

Proof. If the eigenvalues of A are r \ , . . . , r n then 1 1 + xA | = \\ (1 — xr-i) • Since 1 1 + 
xA| is in P po s it has all negative roots, which implies that all Ti are positive. □ 

A consequence of this lemma is a kind of converse to Lemma [93] 

Corollary 9.7. If A, B are matrices, and f [x,y) = |I + xA + yB| is a polynomial that 
satisfies 
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1. All coefficients o/f are positive. 

2. f(x,0) S P. 

3. f(0,x) e P- 

then A and B have positive eigenvalues. 

Example 9.8. Here is another way of constructing matrices that satisfy substi- 
tution that we will study in detail later. We assume that A and C are symmetric 
n by n matrices, A is positive definite, and define 

f(x,y) = det(xA + yI + C) (9.1.2) 

We claim that 

• f (x,y) satisfies x and y substitution. 

• The coefficients of terms of degree n are positive. 

By assumption aA + C is symmetric, so f(<X,y) has all real roots, since its 
roots are the negative eigenvalues of aA + C. Let A = E 2 where E is positive 
definite. Since 

f(x,y) = |A| • IxI + yA" 1 +E" 1 CE" 1 

and E _1 CE _1 is symmetric, it follows that f (x, a) has all real roots. 

The polynomial f H (x,y) formed by the terms of degree n is det(xA + yl), 
and the roots of f H (l,y) are the negative eigenvalues of A. Thus, all roots of 
f (l,y) are negative since A is positive definite, and hence f H has all positive 
coefficients. 

We summarize the different matrices in Table |9~TI 



A 


B A _1 B 


C 


Sub. 


f H 


Coefficients 










positive 


positive 


sym. 


sym. 


I 








sym. 


sym. sym. 


I 


X 






posdef. 


posdef 


I 


X 


X 


X 


posdef. 


I 


sym. 


X 


X 





Table 9.1: Properties of Ax + By + C 



We can compute an explicit example of Lemma 19.51 that shows all terms 
are non-negative; this properly belongs in a later chapter since the matrices 
are only positive semi-definite. 

Lemma 9.9. Suppose that X = diag(xt), Y = diag(yi) are d by d diagonal matri- 
ces, and Vi, . . ., Vd are d-vectors. Let M = (v\, . . ^v^) 1 . Then 

\X + y 1 vlv + --- + y d v\v (l \= XpYj|M[I])| 2 

IC{l,..,d} 
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Proof. By induction it 1 suffices to evaluate |yivj;v + • • • + y d v^v<j|. But this can 
be written as 

IMtYMIHM^YIIMI^yj.-.ydlMI 2 

□ 

Corollary 9.10. If A is symmetric, |I + xA| = Ylifi + T i x )/ v i • • -> v & are d-vectors 
then 



f(x) 



H + xA + yiViVi + .-.+ydV^Vdl = ^ rr n ,• Y i 

ic{i djllieiU + ^J 

Proof. Diagonalize A and apply the lemma. 
If d = 2 we have 

1 1 + xD + yivivj; + y 2 v 2 v^| = 



Yi|M[I]| 2 



□ 



yiy2^ 

i<j 



V 2 ,i V 2/ j 



(l + XTiJtl+XTj 



(9.1.3) 



The statement for n = 3 is 

|I + xD + yiv^ + y 2 v 2 V2 + y 3 v 3 V3| 

f f f 

=f +yi y" — +y 2 y vii— — +y 3 y_ v§ t — — 

^— 'l+xn ^— ' l+xn ^— 'l + xn 
+ yiy2 ^~ 



Vl,i Vy 
V 2 ,i v 2/j 



+ yiy 3 ^2 

i<j 



V W v l,j 

v 3 ,i v 3/j 

V 2/ i v 2/j 

V 3 ,i v 3/j 



(l+XTi)(l+XTj) 
f 

(l+xrOll+xtj) 
f 

(l+xnJU + xTj) 



■yiy 2 y 3 Y- 

i<j<k 



Vl,i Vy Vl /k 
V 2 ,i V 2 ,j v 2/k 
V 3 ,i v 3/j v 3/k 



(l+XTOU+XTjJU+XTk) 



(9.1.4) 



9.2 Interlacing in Sub2 

Interlacing can be easily defined for polynomials satisfying substitution. 



1 thanks to Neil White for this argument 
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Definition 9.11. If f and g are polynomials that satisfy substitution, then we 
say that f and g interlace if for every real a the polynomial f + ag satisfies 
substitution. Using the definition of substitution, we can restate this in terms 
of one variable polynomials: 

If f, g e Sub2 then f and g interlace iff for all a £ M we have that 
f (x, a) and g(x, a) interlace, as do f(a,y) and g(a,y). 

We define < in terms of these one variable polynomials. 

If f, g G Sub2 then f < g iff for all a e K we have f (x, a) < g (x, a) 
andf(a,y) <g(a,y). 

For homogeneous polynomials in two variables we can verify interlacing 
by reducing to a one variable problem. 

Lemma 9.12. Suppose f , g e Sub2 are homogeneous and that the coefficients of f 
are all positive. The following are equivalent 

1. f<g 

2. f(l,y)<g(l,y) 

3. f(x,l)<g(x,l) 

Proof. Note that the first interlacing is in Sub2, and the last two in P. We 
can substitute x = 1 in the first interlacing to deduce the second, so assume 
f < 9(l/y)- By assumption f (l,y) € P . If the roots of f (l,y) areri, . . .,r n 
then the roots of f (a,y) are {ri/a}, and the roots of f (x, b) are {b/ri}. There are 
similar expressions for the roots of g. Since the roots are either all positive, 
we have f (a,y ) < g(a,y ) and f (x, a) < g(x, a). Thus f and g interlace in Sub2. 
Similarly we can show that 1 and 3 are equivalent. □ 

The relation of interlacing is preserved under limits. 

Lemma 9.13. Suppose ft, gt are in Sub2, and ft — > f, gt — > g where f, g are 
polynomials. If fi and gt interlace for all i then f and g interlace. 

Proof. Since f| + agi. is in Sub2 for all i, and fj, + agi converges to f + ag it 
follows that f + ag S Sub2- (See Lemma [10.3l ) Consequently f and g interlace. 

□ 

Substitution is not preserved under the operations of differentiation 2 . Con- 
sequently, we must restrict ourselves to a subset of Sub2. 



2 The partial derivative -M- of f = (xj + X2 + 1) (x^ + X2) (xj — X2) has imaginary roots for 
x = -1/4. 
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9.3 Polynomials in two variables 

We continue our journey toward the generalization of P to polynomials in two 
variables. For polynomials of one variable it is often important to know the 
sign of the leading coefficient. For two variables the homogeneous part is the 
analog of the leading coefficient. The homogeneous part will determine the 
asymptotic behavior of the graph, which is central to our generalization. 

If f is a polynomial in two variables, and we let x and y get large simul- 
taneously then the behavior of f(x,y) is determined by the terms of highest 
total degree. If the maximum total degree is n then 

f(x,y)= Y_ C M*V + Y. C ^*V ( 9 - 31 ) 

i+j=n i+j<n 

The homogeneous part f H is the first sum in 119.3. It . 

Definition 9.14. A homogeneous polynomial f(x,y) = Cox 11 + • ■ ■ + Cny 71 sat- 
isfies the positivity condition iff all ci are positive. 

Using the homogeneous part, we define the class P 2 of polynomials that 
forms the 2-dimensional analog of "polynomials with all real roots". 

Definition 9.15. P 2 (n) consists of all polynomials f of degree n such that f 
satisfies x-substitution and y -substitution, and f H satisfies the positivity con- 
dition. 

P 2 =P 2 (1)UP 2 (2)UP 2 (3)U... 

Sometimes the homogeneous part of a polynomial might be negative. For 
instance, if f(x,u) E P 2 (tl), then the homogeneous part of f(— %,— y) has all 
negative signs if n is odd. In this case, — f(— x,— y) e P 2 . If n is even then 
f (— x, — y) E P 2 . To simplify exposition, we just write f (— x, — y) 6 ±P 2 , which 
is true for every n. We also express this as ±f (— x, — y) e P 2 . 

It's useful to note that f H is not arbitrary. 

Lemma 9.16. Ift e P 2 then f H e P v ° s . 

Proof. See the proof of the more general result (Lemma llO.lll l. □ 

Here are a few elementary facts about P 2 and f H . 
Lemma 9.17. 

1. I/f(x,y) e P 2 then f(y,x) e P 2 . 

2. If also g(x,u) e P 2 , then f (x,u)g(x,y) e P 2 . 

3. If m, . . . , a n are positive, and \>\, . . . , b n are arbitrary, then 

(x+aiy + bj)(x+ a 2 y + b 2 ) • • • (x + a n y + b n ) e P 2 
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4. Ifg e P 2 then (fg) H = f H g H . 

5. Ifg € P2 has degree n — 1 then (f + g) H = f H . 

6. (fH)H =f H 

7 ^ 11 "\ H _9_ f H 

'■ \dx) ~ dx 1 

Proof, f (y,x) H is the reverse of f (x,u) H and f (y,x) satisfies substitution since 
f(x,y) does. The second part follows from the (fg) H = f H • g H . Since each 
factor x + cuy + b is easily seen to be in P2, the product is in P2. The rest follow 
easily from the definitions. □ 

Let's explore the definition of P2. We can write f (x,y) as a polynomial in 
either x or y : 



f(x,y)= f (x) + f 1 (xhj + --- + f n (x)u n (9.3.2) 
= f°(u) + f 1 (y)x + --- + f m (u)x m (9.3.3) 
Ly dl*V (9.3.4) 

The homogeneous part f H is ConU n + ■ • • c n ox n . Consequently n = m, 
and fnM and f n (x) are non-zero constants. The coefficient polynomials fi(x) 
have x-degree n — i since the total degree of f is n and the leading coefficient 
of ft is Ci jTl _i which is non-zero. We can summarize: 

A polynomial f g Sub 2 given in 119.3. 21 , 119.3.31 is in P2 iff 

• n = m 

• The degree of ft and f 1 is n — i. 

• The leading coefficients of f 0, . . . , f n (and f°, . . . , f n ) are posi- 
tive. 

Of course the coefficient polynomials are highly interrelated. We will see 
shortly that consecutive coefficients interlace. A simple property is that if f in 
l|9.3.2l l is in P2 then since f (x, 1) € P we have that 

f + fi + • • • + fn G P 

Iffl< ••• <f tx is a sequence of mutually interlacing polynomials then we 
know that, from left to right, the roots appear in reverse order: f n , . . . , fj. It 
follows that for any a € K there is an i such that 

f 1 ( a), . . . , f t ( oC] all have the same sign e 
f i+i (<*)/•••/ fn (oO a U have the same sign — e . 

We can use this observation to construct a family of polynomials that sat- 
isfy substitution and have all coefficients in P, but are not in P2. 
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Lemma 9.18. Suppose f i <C • • • <C f n. and gi <c • • • <c g n are two sequences of mu- 
tually interlacing polynomials. If 

H(x,y) = ^fiWgi(y) = hWy 1 = Y_ G ^ x ' 

then 

1. h[x, a) and h(a,y) are in P for all a € K. 

2. All Ft and Gi are in P. 

3. Consecutive F^'s and Gt's interlace. 

Proof. Note that h(x, a) = ^ fi(x)gi(oc). The observation above shows that 
gt(a) has one sign change, so we can apply Lemma [1.231 This establishes the 
first part. 

Since Fo = h(x,0) we see that Fo £ P. Now since {g'(y)} is a sequence of 
mutually interlacing polynomials we know that ^ ft (x) ("y ) satisfies the first 
part, and its constant term is V\. Continuing, all Ft are in P. 

Next, for any |3 £ K the sequence { (x + |3 ) gi (x)} is mutually interlacing. By 
the second part the coefficient |3Fi + Ft_i of y l is in P. It follows that Ft and 
Fi_i interlace. □ 

Although substitution holds for h, the degree condition does not. And, it 
is not true that h £ P2 (take ft = gi). However, by Lemma [21. 77l we know that 
h is stable. 

9.4 Inequalities for coefficients of a quadratic in P 2 

Let's look at the simplest non-trivial polynomials in P2. Consider a quadratic 
polynomial f (x,y) £ P2 where we write the terms in a grid 

ao2y 2 




Q01 y an xy 




QOO 110 X a20 x 2 

We have inequalities for the coefficients on each of the three outside lines, 
and the center square. 

Lemma 9.19. Iff £ P2 is a quadratic with the above coefficients, then 

^ a 2 ,! - 4a 2a o 
< a 2 -4a 2 oaoo 
^ a\ 1 - Aa 02 a 2 o 
^ aoiaio - a 00 an 
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Proof. Since f (x, 0) € P, the terms on the bottom row constitute the coefficients 
of a polynomial in P, and the inequality is just Newton's inequality Similarly 
for the left most terms. The diagonal terms are the coefficients of f H , which is 
also a polynomial in P. 

If we solve the equation for x, then the discriminant is 

A x = (aio+ fliiy) 2 -4a 2 o (a 00 + a iu + a 02 y 2 ) 
The discriminant of A x as a function of y is 

A y = 16 (a 02 aio 2 - a i a 10 an + a 00 a n 2 + a i 2 a 20 - 4 a 00 a 2 a 20 ) 
Since A x ^ 0, we know A y ^ 0. Rewriting A y < yields 

an (aio m - aoo an) ^ a 02 (a 2 - 4a 00 a 20 ) + a 20 

Now the all coefficients a 2 o, an, ao 2 are positive since f H G pv° s / anc [ by me 
above a 2 — 4aoo a 2 o ^ 0, so we conclude aio aoi — aoo an ^ 0. □ 

Remark 9.20. We can determine when each of the inequalities is strict. The first 
three are strict when the corresponding polynomials have all distinct roots. If 
ai a i - a o an = then 

= a 02 (a 2 - 4a o a 20 ) + a 20 a^ 

and both summands are zero by the lemma. Since a 2 o > we see aoi = 
0. Interchanging the roles of x and of y shows that aio = 0. Finally a 2 — 
4aoo a 22 = implies aoo = 0. We conclude that f (x,y) looks like 

ao 2 y 2 




Remark 9.21. Polynomials of degree two in P 2 are more than three quadratic 
polynomials spliced together. Consider the two variable polynomial 

16y 2 

8y 17xy 
1 2x lx 2 

The three polynomials on the boundary (x 2 + 2x + 1, x 2 + 17x + 16, x 2 + 8x + 16) 
have all real roots, but the quadrilateral inequality is not satisfied, so the two 
variable polynomial is not in P 2 . This phenomena will be considered later 
(p.|322). 

It is clear from the proof that if all four conditions of the lemma are satisfied 
then the polynomial is in P 2 . 
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Remark 9.22. We will see later (p. 1323b that the last inequality (the quadrilat- 
eral or rhombus inequality) is a consequence of interlacing of the coefficient 
polynomials. It can't be improved. For example, the diagram of coefficients 
of the polynomial (x + e y + 1) (e x + y + 1) is 

,2 




1 (1 

and the ratio (aoiaio)/(aoo a n) equals 

(1 + 



ex 



ex' 



1 + e 2 

and this goes to 1 as e — > 0. If r, s > and we let 

f{x,y) = [x+e(y + l)) r [y + e(x + l)) s (ex + y + l)(x + ey + 1) 

then as e — > the ratio (a riS+ ia T+ i /S )/(a riS a T+ i /S+ i) converges to 1. Thus, 
there are no Newton inequalities with constant greater than 1. 

The last inequality in Lemma [9.191 corresponds to the two adjacent trian- 
gles with a vertex in the lower left corner. The two other corners do not give 
the inequalities 



aioQn 
aoiQn 



Q01Q20 > 
ai a 02 > 



(right corner) 
(upper corner) 



(9.4.1) 



as we see by expanding (x + y — 1 

,2 



\2. 




We will see later (Proposition |10.41"1 that if the coefficients are positive then 
the inequalities J9-4.lt do hold. 

Example 9.23. We can use these ideas to show that perturbations of products 
aren't necessarily in J? 2- Let 

f = e + (x + ay + b) (x + cy + d) 

where a ^ c are positive. The discriminant A y is 16(a — c) 2 e. Since this is 
positive for positive e, we conclude that there are perturbations of products 
that are not in P 2 . 
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For example, if f = (x + y ) (x + 2y ) + e, then f e P 2 if e < 0, and f £ P 2 if 
O 0. 

Although it is difficult to realize a set of numbers as the coefficients of a 
polynomial in the determinant is the only restriction for a rhombus. 

Lemma 9.24. Suppose that ( £ d) ' s fl ma ^ x with positive entries. The following 
are equivalent: 

1. ad-bc>0 

2- ( £ = ( Z Z ) for some £ a tj x 1 y* e P£". 
Proof. We may assume that c = 1. It suffices to consider the polynomial 
f(x,y) = (1 +T!X + siy)(l +r 2 x + s 2 y). 

We want to find positive T\, si, r 2/ s 2 such that 

fa b\^/s! + s 2 r!S 2 +r 2 si\ 
\1 d) \ 1 n+r 2 )■ 

Now it is easy to see that 

(0, ad) = {ri s 2 +r 2 si | si + s 2 = a,T\ +r 2 = d,Ti,T2,si,S2 > 0}. 

Since b g (0, ad) by the determinant hypothesis we have found f e P^ 05 . □ 

There are inequalities for the coefficients of f(x,y) e P 2 that come from 
Proposition 15.381 

Lemma 9.25. Iff (x,y) = Y. o ^Mv 1 € P2 and < k < n then 

k 

^fi(x)fic- i (x)(-l) lc - i >0 



Proof. For any a we apply Proposition 15.381 to f ( a, y ) . □ 

The inequality isn't strict. If k = then then the sum is fo(x) 2 which takes 
on the value 0. 

9.5 Solution curves 

We have been discussing polynomials in P 2 without actually looking at them. 
The geometric perspective gives important information about these polyno- 
mials. This is not surprising, since the condition that distinguishes P 2 from 
Sub 2 concerns the structure of the homogeneous part, and the homogeneous 
part constrains the geometry of the graph. 



CHAPTER 9. POLYNOMIALS IN TWO VARIABLES 



267 



If f (x,y) is any polynomial, then the graph of f is defined to be 

G f ={(x/y)|f(x/y)=0} 

For example, if we choose a fifth degree polynomial 

f = (x + u)(l+x + 2u)(l + x + 3u)(2 + x + 3u)(2 + x + 5y) (9.5.1) 

then the graph of g = 4f + 3 |£ (see Figure l9~3t is surprising linear outside of 
the central region. We will see later that g is in P2. 




Figure 9.4: The graph of a polynomial in P2 

We can decompose Gf into n curves, where n is the degree of f. Define 
Ti : M — > K by setting rt(u) to the i-th largest root of f (x,y) — 0. Since f satis- 
fies x-substitution each is well defined. The roots are continuous functions 
of the coefficients, and so each X\ is continuous. We call rjx) a solution curve 
of f . These are the analogs of the zeros of a polynomial in one variable. We 
can write 

Gf = Ti(I)U'-UT n (i) 

Lemma 9.26. If f € P2 then asymptotically the graph Gf is approximately a collec- 
tion of infinite rays. The curve r-i(R) is asymptotically a ray with slope given by the 
i-th largest root of f H (x, 1). Consequently, for x,y sufficiently large Gf lies in the 
union of the upper left quadrant and the lower right quadrant. 

Proof. If the degree of f is n the polynomial f H (l,y) has n roots |3i, . . ., |3 n by 
Lemma [9. 161 Consider 

f(x,-y) =f H (x,u)+ Y_ c ii*V 

i+j<n 

For x large and y approximately |3fX we see that 

f(x,u)«f H (x,u) + 0(x— J ) 
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and hence for large |x| there is a root y of f (x,y ) — where y is close to PiX. 

Thus, for |x| large f(x,y) has n roots, approximately equal to (3ix, . . ., Pn.x. 
Since f H has all positive coefficients, the roots of f H (l,y) are negative, and so 
all the pi are negative. Each r\ (x) is in the upper left quadrant for x large, and 
in the lower right quadrant for x negative and |x| large. □ 

The polynomial y 2 — x 2 + 1 shows that x-substitution alone does not imply 
y -substitution. Theorem 19.271 remedies this situation by giving a condition 
that along with x-substitution implies y -substitution. This theorem is the best 
way to show that a polynomial is in P2. 

Theorem 9.27. Suppose that f(x,y) is a polynomial that satisfies x-substitution, 
and f H satisfies the positivity condition. Then f satisfies y-substitution, and is in P2. 

Proof. Since each solution curve n is continuous and and is asymptotically a 
ray that lies in the upper left quadrant or the lower right quadrant it follows 
that the graph of each rt meets every horizontal line, so the equation n (x) = a 
has a solution for every a. In particular, for any y there are n solutions to 
f(x,y) = since f (a,rt(a)) = for each i. Since f (x,y) has degree n in x this 
implies that f (x,y) satisfies x-substitution. □ 

Remark 9.28. It is important in the definition of substitution that the homo- 
geneous part has all positive coefficients. For instance, choose g, h 6 P, and 
define f(x,y) = g(x)h(y). The homogeneous part of f is a monomial x n y m , 
and does not satisfy positivity, and so is not in P2 The graph of f consists of n 
horizontal lines, and m vertical lines. We will see later that if g, h G pP° s then 
f is in the closure of P2- 

Here is an important way of constructing polynomials in P2. We will later 
show that these two conditions are equivalent. 

Lemma 9.29. If A and C are symmetric matrices, and A is positive definite then 

f(x,y) = det(xA + yI + C) e P 2 . 
Proof. This is Example l9.8l □ 

Lemma 9.30. If a solution curve off e P2 is horizontal or vertical at a point (a, b), 
then (a,b) is the intersection of at least two solution curves. 

Proof. If the solution curve is horizontal at (a, b) then — (a, b) = 0. Since a 

dx 

solution curve is implicitly defined by f (x,y ) =0, we can differentiate to find 
f x + ^f-y =0. This shows that f x (a,b) = 0. Consequently, the point a is a 
double point of the function f (x, b), and so (a, b) lies on two solution curves. 
The vertical case is similar. □ 

Lemma 9.31. The solution curves of a polynomial in P2 are non-increasing. They 
always go down and to the right. 
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Proof. If we ever had a solution curve that was increasing, then it would have 
a local minimum since it is asymptotic to a line of negative slope. A horizontal 
line slightly above this minimum would intersect it in two distinct points, 
contradicting the definition of solution curve. Similarly, considering the curve 
as a function of y we see it can never go left. □ 

Lemma 9.32. If i is a polynomial in P2 then f (x,x) e P. More generally, f (ax + 
b, cx+ d) is in P for any a, b, c, 6. such that a/cis either greater than the largest root 
o/f H or less than the smallest root o/f H . 

Proof. Since each V{ (R) is connected and asymptotically is a line with negative 
slope, any line of positive slope intersects T-j.(R). See Figure 1931 This proves 
the first part. 

Under the hypothesis on a/c any solution curve either has asymptotes 
whose slopes are both larger (or both smaller) than a/c. Any line with slope 
a/ c will intersect such a curve. □ 




Figure 9.5: The intersection of f (x,y) = and y — ax + b 



The hypothesis on a/c are necessary. In Figure 1931 it is easy to find lines 
that do not intersect the the solution curve whose ends are marked "*". 

The proof of the following theorem requires polynomials in four variables, 
and is proved in Theorem 1 10. 601 

Theorem 9.33. Ifi € P2 then for positive a, b, c, d we have 
f ( ax + by + u, cx + dy + v) e P 2 
As an example, notice that if f e P 2 then f (x, x + y) € P2. 
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In addition to considering the intersection of the graph of f with lines we 
can consider the intersection of f with the hyperbola xy — — 1. These intersec- 
tions correspond to solutions of f (x, — 1/x) = 0. Since this is not a polynomial, 
we multiply by x n where n is the degree of f. The degree of x n f (x, —1/x) is 
2n. 

For example, if f e P pos (n) then F(x,y) = y n f(x/y) is in P2. Obviously 
x n F(x, — 1/x) = (— l) n f (— x 2 ). Since all roots of f are negative, each root of f 
gives rise to 2 roots of x n F(x, —1/x) and so all 2n roots are accounted for. 

The reverse of one variable requires a negative sign. If we use 1 /x instead 
of —1/x we get stable polynomials. (See p. 16201 ) 

Lemma 9.34. J/f e P 2 (n) then x n f(x, -1/x) e P(2n). 

Proof. Since all asymptotes of f have negative slope the graph of xy = — 1 
meets the graph of f in n points in the upper left quadrant, and n times in the 
lower right quadrant - see Figure l9~6l This gives 2n solutions to f(x, —1/x) = 0, 
and since this is the degree of x n f (x, —1/x) the conclusion follows. □ 



Figure 9.6: The intersection of f (x,y) = and xy = — 1 



Remark 9.35. Harmonic functions are an important class of functions. A func- 
tion is harmonic if it is the real part of an analytic function. As long as the 
degree is at least two then no polynomial in P2 is harmonic. A geometric ex- 
planation is that if f is a harmonic polynomial then the real part of f (x + iy) 
has asymptotes given by the 2n rays with angles 7t(2k + 1 ) /2n, k = 1, . . . , 2n 
([69 1). Thus, the real part has asymptotes whose slopes are positive and nega- 
tive, and hence is not in P2. 

Example 9.36. Recall (Example 19.2b that polynomials of the form f(x,y) = 
I + xA + yB|, where A, B are n by n symmetric matrices, satisfy a strong line 
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intersection property: every line through the origin meets the graph of f in n 
points. This fails for polynomials in P2. For instance, if we define 



f(x,y) 



4 + x + 2y -3 

-3 -2 + x + y/2 



-17 + 2x + x 2 -2y + +V 2 



then f(x,y) 6 P2, and it is clear from the graph of f (Figure l9~71 l that there is a 
range of lines through the origin that do not meet the graph of f . 




Figure 9.7: Satisfies substitution, fails to meet all lines through the origin 



9.6 Interlacing in P2 

Definition 9.37. If f, g S P2 then f and g interlace iff f + ag is in ±P2 for all a. 
If f and g have the same degree then it is possible that (f + ag) H has negative 
coefficients. That's why we we require that f + ag € ±P2, which we recall 
means that either f + ag 6 P2, or — (f + ag) £ P2. If in addition the degree of f 
is one more than the degree of g then we say f < g. In this case (f + g) H = f H , 
and so the only condition we need to verify is substitution. This leads to an 
equivalent definition in terms of substitutions: 

If f, g £ P2 then f < g iff for all a £ K we have f (x, a) < g(x, a) ( or 
for all a £ K we have f (a,y ) < g(a,y)). 

Figure l9~8l shows the graphs of the two interlacing polynomials 

f = (x + y + l)(x + 2y +3)(x + 5y +4)(x + 3y +2) 

and + • From this we see that there really is a geometric interpretation to 
interlacing: if f < g then the solution curves of g interlace the solution curves 
off. 

We define strict interlacing f < g to mean that f < g, and the graphs of 
f and g are disjoint. It is easy to see that this is equivalent to saying that 
f(x, a) < g(x, a) for all ael. 

The following lemma covers the fundamental properties of interlacing in 

P 2 . 
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Figure 9.8: The graphs of two interlacing polynomials in P2 



Theorem 9.38. Iff, g,h G P2 and f < g, f < h, a, (3 positive then 

• ccg + |3h e P2 

• f <jctg + |3h 

Proof. Lemma 11.101 shows that <xg + f3h satisfies substitution. Since (ag + 
|3h) H = ag H + |3h H it follows that ag + |3h satisfies the positivity condi- 
tion. □ 

Theorem 9.39. P2 is closed under differentiation. Iff 6 P2 then f<-§£rfor 1 ^ i ^ 
2. 

Proof. Choose a S M, and consider g = f + a-^-. Since g H = f H , we will show 
that g G P2 by showing g satisfies substitution. If we choose |3 e R then 

3f 

g(x,p) =f( X/ p) + a— (x,(3) 
0x1 

Since derivatives in one variable interlace, it follows from Theorem 19.271 that 

g e P2, and and that f < -p-- □ 

^ — 0x1 

One of the surprising things about this definition is that interlacing linear 
polynomials are highly constrained. In P, any two polynomials of degree one 
interlace. In P2, the only way that degree one polynomials interlace is for them 
to be essentially one-dimensional. Suppose that ax+by +c <C sx+ty +u where 
Q,b, s, t are positive. The requirement that (ax + by + c) + <x( sx + ty +u) lies in 
P2, implies that sx + ty is a multiple of ax + bx. If we define f (x) = x + c and 
g (x) —x + t/a then we can express these two interlacing polynomials in terms 
of interlacing one variable polynomials. Namely, f(ax + by) — ax + by + c, 
ag(ax + by) — sx + ty + u, and f g. Thus there are no intrinsically two- 
dimensional interlacing linear polynomials. Geometrically, all this means is 
that the graph of interlacing linear polynomials must be parallel lines. 
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A consequence of this observation is that there appears to be no simple cre- 
ation of mutually interlacing polynomials. We have seen how to create mutually 
interlacing polynomials using linear combinations of the products of n— 1 fac- 
tors from a polynomial with n factors. The fact that this set of polynomials is 
mutually interlacing follows from the fact that any n degree one polynomials 
in P are mutually interlacing. 

Interlacing polynomials can be decomposed. This fact follows from from 
the assumption that the polynomials f + ag all lie in 1LP2, and hence the coef- 
ficients of the homogeneous part all have the same sign. 

Lemma 9.40. Suppose f, g G P2, f and g have the same total degree, and f, g inter- 
lace, then 

1. f H and g H are scalar multiples of each other. 

2. There is an r G P2 and positive a such that 

a) f <t 

b) g = af ± r 

Proof. First of all, we note that (f + ag) H = f H + ocg H , so f H and g H interlace. 
Since f + ag G P2 for all a, it follows that (f + ag) H G P pos for all a. From 
Lemma 12.141 there is a y such that f + yg is not in P pos , so the second case of 
Lemma \2. 141 is not possible. Thus, g H is a constant multiple of f H . 

By the previous lemma we can choose a so that g H = af H . We define 
r = ±(f — ag) where the sign is chosen to make the coefficients of r H positive. 
All linear combinations of f and g are in P2, so r is in P2. Moreover, the total 
degree of r is less than that of f since we removed all the highest degree terms. 
Also, f and r interlace since their linear span is the same as the linear span of 
f and g. □ 

We use this definition to define <|C for P2. Say that fffCg iff there is an 
r G P2 such that g = af + r for some positive a, and f <r. 

9.7 Linear transformations on P 2 

We study linear transformations on P2. Just as in one variable, linear transfor- 
mations preserving P2 preserve interlacing. The proof is immediate from the 
definition of interlacing. 

Theorem 9.41. Suppose thatT: P — > P2 and S: P2 — ► P2 are linear transforma- 
tions. If f, g G P interlace then Tf, Tg interlace. If f , g G P2 interlace then Sf, Sg 
interlace. 

Corollary 9.42. Iff<gare in P2, and a, b are positive then 

dx dy dx dy 

Ifi <g then£-<£- 
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Proof. The map f i— ► + b|i maps P2 to itself, so it preserves interlacing. 
The second part follows form the one variable result. 

□ 

Lemma 9.43. Iff [x,y) < g{x,y) in Pi then f (x,x) < g(x,x). 

Proof. The map y 1— ► x is a linear transformation from P2 to P, and so it pre- 
serves interlacing. □ 

A linear transformation P — ► P sometimes induces a linear transforma- 
tion P2 — > P2- Analyzing the geometry of the graph of the image of a polyno- 
mial under this induced transformation yields information about new trans- 
formations on P or P pos . 

Theorem 9.44. Suppose that T : P — ► P is a linear transformation that preserves 
degree, and maps polynomials with positive leading coefficients to polynomials with 
positive leading coefficients. The induced linear transformation 

T*(xV) =T(x i )y^ 

defines a linear transformation from P2 to itself. 

Proof. Suppose f = Y. ^i{x)y l where ft has degree n — i. Since T preserves 
degree, the degree of T(ft) is n— i, and so (Tf ) H is a sum of terms x n ~ l ( leading 
coefficient of Tft). Since T preserves the sign of the leading coefficient, (Tf ) H 
has all positive terms. 

By Theorem l9.27l it suffices to show that (f ) satisfies x-substitution. If we 
choose b e I then 

n*f)(x,b)=T(f(x,b)) 

Since f(x, b) e P we know T(f(x, b)) is in P, and so T* satisfies x-substitution. 

□ 

We will revisit induced transformations in Chapter [1111 1.121 

Remark 9.45. It is important to see that a linear transformation T: P — > P does 
not in general induce a linear transformation P2 — ► P2- The assumptions of 
degree and positivity are essential. The proof shows that substitution will 
always be met. For instance, consider 

T: g i-> g(D)x. 

This satisfies T(l) = x, T(x) = 1 and T(x k ) = for k > 1. Since 

T(x + y) 2 =T(x 2 + 2xu+u 2 ) =2u + xu 2 
we see that T(x + y) 2 ^ P2 since the homogeneous part is xy 2 . See § 111111.121 
Corollary 9.46. -T/T(xV) =H i (x)H j (y) then!: P 2 — ► P 2 . 
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Proof. The maps x 1 1— ► Hi (x) and y ' i— ► H 3 - (y ) map P to itself by Corollary l7.44l 

□ 

We can make use of a special function identity to verify Corollary l9.46l in a 
special case. We have 



i=0 



i=0 

= 2 n/2 H 



x + y " 



and the last polynomial is in P2 since H n is in P. 

Theorem |9.44| is not true 3 if we only have that T: P pos — ► P. 

Lemma 9.47. Consider the maps T: f(x) 1— * f(x + y) and S: f(x) 1— * F(x,y), where 
F(x,y) is the homogeneous polynomial corresponding to f. These are linear transfor- 
mations T: P — >Y 2 andS: P ± (n) — ► P2N. 

Proof. The map T is linear and preserves degree, and S is linear if we restrict 
to polynomials of the same degree. □ 

Corollary 9.48. The linear transformation T(x 1 -y' ) = £il maps P2 to P2. 

Proof. Apply Theorem 19 .441 to the exponential map in each variable. □ 

There are a few instances where we can describe all the coefficients of y 1 
in a polynomial belonging to P2. Here are three examples - in each case the 
left sides are of the form f (x + y) so the right hand sides are in P2. H n is the 
Hermite polynomial, and the identity is a consequence of the Taylor series and 
the identity H^ = 2nH n _i. 



\ — n \ / 



H n (x + y)=^2 k r H 



i=0 

k 



If we expand f (x + y ) in its Taylor series then for any f e pP os 



f(x + y)=^f' i '(x)|eP 2 (9.7.1) 



3 For example, we saw in § 1717.71 that T{x n ) = (x) n is such a map. If we set f = (1 + 
x + ij){1+2x + ij){2 + 3x + ij) then the coefficients of T*f interlace but T*f does not satisfy 
x-substitution since (T„f ) (3,11 ) has imaginary roots. 
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A linear transformation from P to P preserving P pos determines a linear 
transformation from P to P2. 

Lemma 9.49. Suppose T : P — ► P is a linear transformation that preserves degree, 
and maps P pos to itself. IfT(x x ) — ft and we define S(x x ) = y l fi(x/y), then 
S:P pos ^P 2 . 

Proof. Suppose that g = Y.^=o a ^ xX is in P pos . The homogeneous part of S(g) 
is a n S(x n ) = a n y n f n (x/y) whose homogenization is in P pos since T maps 
P pos to P pos . To verify substitution, choose a and consider 

S(g)(x,a) = Y_ a-iS{x l ){x, a) = Y_ ai<x l fi(x/a) 
This last polynomial is in P pos , as the diagram shows 

ppos x^txx ^ ppos 

S(g)(x,a) 



ppos < 



. ppos 



□ 



Example9.50. If we choose the affine map T(f) = f(x+l) then S(x x ) = (x+u) 1 , 
andS(f) =f(x + y). 

Lemma 9.51. If J: P — > P and T(x n ) = p n/ then for any a the linear transforma- 
tion S(x k ) =x n ~ k p k (a) mapsP ± (n) — > P(n). 

Proof. The proof follows from the diagram 

1 . . homogenization 

P* (n) > P 2 



s 

V 

P(n) 



T. 



P 2 



□ 



9.8 Applications to linear transformations on P 

We now apply results about P 2 to get information about linear transformations 
on P. 

Corollary 9.52. Assume that the linear transformations T, S : P — ► P map x n to 
a polynomial of degree n with positive leading coefficient, and maps pv° s to itself. 
. Define a transformation V on P[n) by V(x x ) = T(x x ) S(x n ~ x ). Then V maps 
P pos (n) to itself. 

Proof. V is the composition 
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„r.nc, * homogenize , 

P pos n > P 2 (n) 



v 



¥ 



T.xS, 



(x,y)>-»(x,x) , 

P(n) -= P 2 (n) 

where T* and S* are defined in Theorem l9.44l The conclusion follows form 
Lemma 19321 □ 

Corollary 9.53. If H^fx) is the Hermite polynomial then the transformation x 1 
HJx) H n _t(x) maps P pos (n) toP(n). IfLi[x) is the Lagnerre polynomial then the 
transformation x 1 U(x) L n _t(x) maps P pos (n) to P pos (n). AZso, f/ze transfor- 
mation x i i ► U(x)H n _i(x) maps P pos (n) toP(n). 

Corollary 9.54. Assume that the linear transformation T: P — > P ?«flp x n to a 
polynomial of degree n icz'f/z positive leading coefficient. The linear transformation 

Proof. Take S to be the identity in Corollary |9.52| □ 

The next result follows from the corollary, but we will refine this result in 
LemmalHO] 

Lemma 9.55. The transformation x l i— > Hi(x)x n ~ x maps P pos (n) to P(n). 

Lemma 9.56. Suppose that f € P, fl«<i T: P — ► P. Then 

T«(f(x + y))(x,x]6P 

Proof. We know that f (x+y ) € P 2 , and so T* (f [x+y )) € P 2 . Apply Lemma l932l 

□ 

Corollary 9.57. Suppose T: P — > P z's a /inear transformation that preserves degree 
and maps P pos to ztoeZf. T/ze Mobius transformation T 1/z satisfies \/ z : p pos — ► 

ppos 

Proof. LetTfx 1 ) = f t (x), and recall that Tj/Jx 1 ) = fi REV . Let S(x l ) = y x h{x/y) 
as in Lemma IHU Notice that S(x n )(l,y) = y n f n (l/y) = T 1/ z (y) and so 
S(f)(l,y) = T 1/z (f). Now S maps P pos to P 2 by Lemma EM and hence 
T 1/z (f)=S(f)(l,y)eP. 

See (p. 15121 for a different proof. □ 

Corollary 9.58. The linear transformation x i t-> Li REV (-x) maps P pos — ► P pos . 

Proof. Apply Corollary 19371 to Lemma l"l5.41l □ 

The reason that we can substitute x = y in a polynomial in P 2 is that the 
line x = y must intersect every solution curve. The line x + y — 1 does not 
necessarily meet every solution curve, but an easily met assumption about the 
homogeneous part will allow us to substitute 1 — x for y. 



CHAPTER 9. POLYNOMIALS IN TWO VARIABLES 



278 



Lemma 9.59. Suppose T: P — ► P maps P pos (n) to itself, and define S(x x ) — 
(l — x) n ~ x T(x x ). Assume that the leading coefficient ofT(x x ) is d, and write f (x) = 
Y_ a.iX x . Iff € P v ° s and either condition below is met then S(f ) e P. 

• all roots ofY_ diCtx 1 are greater than —I. 

• all roots ofY. QiCtx 1 are less than — 1. 
Proof. S is determined by the composition 

ppos (n) homogenize ^ 



P(n) - P 2 (n) 

If we choose f € P pos as in Corollary 19 .521 we know T*(F) € P 2 , where F is 
f homogenized. The asymptotes of T* (F) have slopes that are the roots of the 
homogeneous part of (F), which is Y_ <iiCiX v . If either condition is met, then 
all solution curves meet the line x + y — 1 . □ 

The linear transformation T: x k Hk(x)x n ~ k acting on P(n) becomes 
very simple if we make a transformation. We need the identity 

(YX 

T(x-2 + a) n =H n ( — ) (9.8.1) 
If we substitute a = then 

T(x-2r = FW0) = {° n ifniS ° dd 
; |(-l) n / 2 (n-l)! if n is even 

This degenerate behavior at x = 2 is reflected in the following lemma. 
Lemma 9.60. The linear transformation T: x k i— > H^(x)x n ^ ]c acting on P(n) maps 

p(2,oo) y p(— oo,2) ^ p 

Proof. Consider the transformation S(f) = T(f(x — 2)) acting on P(n), and let 
W(x k ) = H k (x). The key observation (from is that 

S(x k ) =T(x-2) k =x n - k H k (0) (9.8.2) 

Therefore, if g = Y. a i x \ an d the homogenization G is g, then 

so we get the basic relation 

S(g)(y)=W,(G)| x=0 
If g e P ± , then G is in P 2 , and W*(G) € P 2 , so S(g) e P. Therefore, T(g) e P. 

□ 
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Remark 9.61. We can use 1 19.8. It to get precise information about the possible 
range of S in l|9.8.2b . Suppose that I n is the smallest interval containing all the 
roots of H n . The roots of H n (y) lie in the interval ^I n . If we knew that T 
preserves interlacing then from Lemma 16.211 we conclude from 1 19.8. Il l that if 
< a < 1 then 



9.9 Properties of P2 

The main result of this section is that the coefficients of a polynomial in P2 are 
in P. This allows us to prove that a polynomial is in P by identifying it as a 
coefficient of a polynomial in P2. 

Corollary 9.62. If f 6 P2 then all coefficients ft in J9.3.2I I and all coefficients f 1 in 
( 19.3.3b are in P. Moreover, ft < ft+i and f 1 <f 1+1 for ^ i < n. 

Proof. If f e P 2 then A (x,0) = i!ft(x) and hence f t e P by Theo- 

rem !9.39l and substitution. 

Since f + aj^ is in P2 for any a the coefficient of y l is in P. This coefficient 
is f i + af i+i and so we conclude that f i + i and ft interlace. Since the degree of 
ft is greater than the degree of fi+i we find fi <ft+i. □ 

There is a simple condition that guarantees that all the coefficients strictly 
interlace. 

Lemma 9.63. Iff{x,y) e P 2 and f (x, 0) has all distinct roots, then 

1. All fi have distinct roots. 

2. fi < fi+i 

Proof. Assume that fo(r) = fi(r) = 0. Let g{x,y) — f(x + r,y), and write 
9 = L SiMy" = L 9HvW = L cUjxV. Since g (0) = gi(0) = 0, we find 
io,o = 10,1 = 0. Thus, g° is divisible by x 2 . Since g 1 interlaces go, g 1 is divisible 
by x. This implies that 0.1,0 = 0, and hence go has a double root at zero, but this 
contradicts the hypothesis that fo has no repeated roots since go(x) = fo(x + r). 
Since fo < fi, it follows that f 1 has all distinct roots. Continuing, we see all fi 
have distinct roots. □ 

The polynomial f(x,y) = (1 + x + y) (1 +2x + y) shows that f(x, 0) can have 
all distinct roots, yet f (0,u) can have repeated roots. 

Corollary 9.64. If f e P2 and f < fi then f t < f i+1 for ^ i < n. 

Lemma 9.65. Iff given in (19.3.2b satisfies y-substitution then the sequence of coeffi- 



cients is log concave: f fi < Ofor ^ i ^ n — 2. I/f and fj have no roots in 



common then 



fi + 1 fiH 



< 
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Proof. Since f has all real roots for any fixed x, we can apply Newton's inequal- 
ities (CorollaryED to find that f?(x) > fj_i(x)f j+ i(x) unless f (r) =fi(r) =0 
for somer in which case we only have f?(x) ^ fj_i(x)fj + i(x). □ 

Corollary 9.66. Set 

n n 

f (x,y) = ]J{x + b t + c iV ) = Y_ fj Mv ] 

i=l i=0 

Assume that the d are positive for 1 < i < n. Then the coefficients f j are polynomials 
of degree n — j, and 

f <fi<f 2 < ... 
If flZZ b| are distinct then the interlacings are all strict. 

Proof. Lemma 19.291 shows that f € P2. By Corollary 19.641 it remains to show 
that f < f 1 . If we expand f we find (see Lemma [9 .1031 1 

t ( \ V~ f oM 

fiM = 2_ c i 



x + bi 



Since all coefficients c, are positive we can apply Lemma[L20]to conclude that 
f < f 1 since fo has all distinct roots. □ 

Corollary 9.67. Ifi — Y_ ^iMv 1 and g — Y. 9ii x )v x are both in P2(n) then 

^fi(x)g n _i(x) £P 

i 

Proof. The polynomial in question is the coefficient of y n in the product f g. 

□ 

Note that if we take f = Y. aiX n ~ l y l and g — Y. 9i xTl l V x where Y. a i xl 
and bi* 1 are both in P pos then we conclude that £j a n _ibtx 21 6 P. This is a 
simple modification of the Hadamard product. 

Remark 9.68. Since the first two terms of the Taylor series (|9.7.1|l of f {x+y ) are f 
and yf it follows that f < f '. Thus, we can prove Rolle's theorem ("Theorem ll.H 
without analyzing the behavior of the graph of f and f. 



pos 



9.10 The analog of P 

A polynomial in P with positive leading coefficient is also in P pos if and only 
all of its coefficients are positive. We define Pj in the same way: 

ppos — |f g p 2 j a n coefficients of f are positive} 
We start with a useful alternative criterion. 
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Lemma 9.69. Suppose f e P 2 and write f = ft(x)y\ Ifi G P vos then f e V p 2 . 

Proof. Since ft(x) <fi+i(x) and since fo(x] G P pos it follows that all ft are in 
ppos jsj ow leading coefficients of all ft's are terms of f H and are therefore 
positive. Consequently, all f j/s have all positive coefficients, and thus f (x,y) G 
pP° s (n). ' □ 

We discuss the generalization of P pos to more than two variables in 
§ 110110.41 and more properties of P^ 05 can be found there. 

Our next result is that a simple translation takes a polynomial in P 2 to one 
in PP° S . 

Lemma 9.70. If f (x,y ) G Pi[n), then there is an a such that 
f(x + a,y)GPf». 

Proof. If f (x,y) = ft My 1 / then we can choose a such that fo(x + a) is in 
P pos . It follows from Lemma HSH that f (x + a,y) G Pf 8 . □ 

Interlacing of polynomials in stays in F^. 
Lemma 9.71. J/f G F p 2 ° s and f < g then g G F p 2 ° s 

Proof. If we write f = ^ fi(x)y l and g = Y. 9iMv l then we know that fg^ go- 
Thus go G P pos and the conclusion follows from Lemma 19.691 □ 

If f is in P2 then f (x, ax) G P for positive a, but this can be false for negative 
a. However, it is true for P!? 05 ". 

Proposition 9.72. If f (x,y ) G P^° s (n) then f (x, ax) G P for all a. 

Proof. If suffices to consider a < 0. We will show that the line y = ax meets 
the graph of f in n points. Without loss of generality we may assume that 
there are r roots of f H (x, 1) greater than a, and n — r less than a, for some r, 
where ^ r ^ n. 

Consider the upper left quadrant. Recall that the solution curves are 
asymptotic to lines whose slopes are the roots of f H (x, 1 ) . Thus, there are n — r 
solution curves that are eventually above the line y = ax. Since f G P^ 05 , each 
of these solution curves meets the x-axis in (— 00, 0) which is below the line 
y = ax. Thus, there are n — r intersection points in the upper left quadrant. 

Similarly, there are r intersections in the lower right quadrant. We've 
found n intersections, and the proposition is proved. □ 

Our next result is that all the homogeneous parts of a polynomial in Pj 08 
are in P pos . There are simple counterexamples that show that this is not true 
for arbitrary polynomials in P2. The proof uses properties of homogeneous 
polynomials in three variables - see Lemma [l0.37l 

Lemma 9.73. Suppose f G V\ s (n) and g r is the polynomial consisting of all terms 
of total degree r. If we set ti r (x) = g r (x, 1) then h r G P pos for 1 ^ r ^ n and we 
have interlacings 

h n <h n _! < H n _2 < • • • 
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If f € P2 then the graphs of f and f H have the same asymptotic behavior. 
Although these asymptotes do not interlace in general, they do if f G P^: 

Lemma 9.74. Iff G P^ then f[x, a) <|C f H (x, a) for a sufficiently large. 

Proof. Assume f e P^fn) and let g r be the homogeneous polynomial con- 
sisting of all terms of f of degree r. We can write 

n 

f(x,a) = ^ gi (x,a) (9.10.1) 

i=0 

We know from Lemma [9.731 that gt (x, a) < gt-i (x, a) for 1 ^ i ^ n. We show 
that if a is sufficiently large then f(x, a) sign interlaces f H (x, a) and that the 
sign of f (x, a) on the i-th largest root of f H is (— l) n+ \ 

Iff H (8,l) = then f H (a6, a) = g u (a|3, a) = since f H is homogeneous. 
Upon computing f (a|3, a) we find 

f (a(3, a) = + Bn-ifap, a) + 0(cc n " 2 ) 

Now since g n (x, a) < g n _i (x, a) and the leading coefficients of both g n (x, a) 
and g n _i(x, a) are positive we know that the sign of g n _i(a(3, a) is (— l) n+1 
where |3 is the i-th largest root of f H (x, 1). Since g n _i(ct6, a) is 0(a n_1 ) it 
follows that for a sufficiently large 

sgng n _i(a|3,a) = sguf(a6,oc) = (-l) n+1 

and the conclusion follows. 

□ 



9.11 The representation of P2 by determinants 

The main result of this section is that there are simple determinant represen- 
tations for polynomials in Pj 08 and Pj. Before we can prove that polynomials 
in Pj 05 can be represented by determinants, we need the following result due 
to Vinnikov and Helton, see |791ITT2"llT74l . 

Theorem 9.75. Suppose that f(x,u) is a polynomial with the property that 
f(ax, |3x) e P for all a, 6 € K. If f[0, 0) = 1 then there are symmetric matrices 
B, C such that 

f(x,y) = |I + xB +yC|. 

Theorem 9.76. Iff e F^ s and f (0, 0) = 1 then there are positive definite symmetric 
matrices Di, D2 so that 

f{x,y) =|I+xDi+uD 2 |. 

Proof. Since Proposition |9.72| shows that f (x, y ) satisfies the hypothesis of The- 
orem [9]75l Vinnikov's result shows that there are symmetric matrices B, C so 
that f[x,y) — |I + xB + yC\. The result now follows from Corollary l9.7l □ 
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Corollary 9.77. If f e Pi then there is a symmetric matrix C, a positive definite 
diagonal matrix D, and a constant |3 so that 

f(x,y] = |3|xI + yD + C|. 

Proof. There is an a so that f (x + a,y) G Pf ' so we can find positive definite 
matrices Di,D2 and a constant a = f (a, 0) so that 

f(x+ a,y) = a|I +xDi +yD 2 | 

f(x,y) = a|I + (x-a)Di+yD 2 | 

1 /2 

Since Di is positive definite, it has a positive definite square root Dj 

f(x,y] = a|Di| ■ |D~ 1/2 (I- aD^D" 172 + xl + yD~ 1/2 D 2 D~ 172 | 

i/2 1/2 

If we let A be the symmetric matrix D x (I — aDijDj , B the positive defi- 
nite matrix D 1 D2D 1 , and |3 — a|Di| then 

f(x,y) = (3|xI+yA + B| 

Now let ODO 1 = A where O is orthogonal and D is diagonal 

f(x,y) = p|xH-yD + 0*BO| 

Since 0*80 is symmetric, the corollary is proved. □ 

Remark 9.78. We can use the determinant representation of P2 to show that 
P2 is closed under differentiation without using any geometry. Recall that 
the characteristic polynomials of a symmetric matrix and any of its principle 
submatrices interlace (see Theorem ll.61l l. 

Suppose that f (x,y) S P2(n), represent it by |xl + yD + C| as above, and 
let M = xl + yD + C. Let {di} be the diagonal elements of D. Since the only 
occurrences of x and y in M are on the diagonal, it is easy to see that 

— |xl +yD + C| = |M[1]| + |M[2]| + • • • + |M[n]| 
ox 

Q 

— |xI + yD + C| = di|M[l]| + d 2 |M[2]| + --- + d n |M[n]| 
9y 



If we substitute a for y we see that 

|xl + aD + C| <|M[i]](x,a) 

since principle submatrices interlace. All the polynomials |M[i]| have positive 

homogeneous part, so we simply add the interlacings to conclude that 

f (x, <x) < ?r-f (x, a). It follows that f < and in particular the derivative is in 

P 2 . 
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9.12 When some coefficients are zero 

If f (x) is a polynomial in P, then we know two facts about zero coefficients: 

1. If the constant term is zero then f is divisible by x. 

2. If two consecutive coefficients are zero, then all the coefficients of terms 
of smaller degree are zero. 

We investigate the implications of zero coefficients for polynomials in P2. We 
will show that if a row (or column) has two consecutive zero coefficients then 
there is a large triangular region of zero coefficients. For example, the coeffi- 
cient array of the polynomial f (x,y) = (x + y) 3 (x + y + l) 2 has a triangular 
block of zeros: 



The polynomial 3f/3x has constant term zero, yet is irreducible. 

Lemma 9.79. Ifi — Y. a i,j x1 "y' e p 2 has a row (or column) with two consecutive 
zero coefficients, say a r<s _i = a T/S = 0, then then ay = Ofor alli + j ^ r + s. 

Proof. We use the fact that if g(x) interlaces x k h(x), then g is divisible by x k_1 . 
If we write f = Y. f i Mv 1 then the hypothesis says that f T has two consecutive 
zeros, so f r is divisible by x s . We then know that f r+ i is divisible by x s_1 , f r+ 2 
is divisible by x s ~ 2 and so on. 

Next, write f = Y.^iiv) x ^- The above paragraph shows that a r +i ; k. = 
ir,k = for ^ k ^ s — 1, so Fk has two consecutive zeros, and hence is 
divisible by x r . Continuing, F^+i is divisible by x T_1 , and so on. Thus, we've 
found that all ay are zero if i + j < r + s. □ 

If we consider polynomials in that have a quadrilateral where the 
strict rhombus inequality is not satisfied, then there is also a large triangular 
region with zeros. 

Lemma 9.80. If^ — Y. a y xt y' € ^2 an d there are r, s such that 
a T , s a T +i /S +i — a T+ i /S a T , s +i = then ay = Ofor all i + j < r + s + 1. 

Proof. Write f = Y.^iMv x - The expression a rjS a T+ i, s+ i — a r+ i, s a r , s+ i is the 
determinant of consecutive coefficients of two interlacing polynomials. If it is 
zero, then by Corollary l4~3T1 we have a r s = a r , s +i = 0. The result now follows 
from Lemma 19791 □ 

Corollary 9.81. Ifi — Y_ iy xt y' <= an ^ f (0/0) 7^ then all rhombus inequal- 
ities are strict. 



1 

2 5 



1 8 10 

3 12 

3 





10 

8 5 
12 1 
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As with one variable, we find that agreement at two consecutive coeffi- 
cients implies lots of agreement. 

Corollary 9.82. Suppose that f g in P 2 °\ f = Y a i.,i xi V^> an d 9 — Y byxhj'. 
If there are r, s so that a T/S = b T , s and a T +i /S = b T+ i, s then ay = by for i + j < 
r + s + 1. 

Proof, f — g is in P^ 05 , and has two consecutive zeros. □ 

We can replace the consecutive zero condition with equal minimum de- 
gree. 

Lemma 9.83. Suppose that f = Y fi My 1 € p 2, and let d be the minimum x-degree. 
If two consecutive ft have degree d then so do all earlier ones. 

Proof. We first differentiate d times with respect to f , and so assume that two 
consecutive coefficients are zero. By Lemma 14.101 all earlier coefficients are 
zero for all x, and so are identically zero. □ 

9.13 The Hadamard product 

We can realize the Hadamard product as a coefficient in a polynomial in two 
variables. Begin with two polynomials of the same degree: 

f (x) = ao + aix + ci2X 2 + • • • + a n x n 
g(x) = b + bix + b 2 x 2 + h b n x n 

Now reverse g and homogenize it. 

G(x,y) = b n x n + bu-ix*-^ + b n _ 2 x n -V + • • • + b y n . 

If we multiply and write as a series in y 

f(y)G(x,y) = a b n x n 

+ (aib n x n + aobn-ix n-1 >U + • • • 
+ (aobo + --- + Q n b n x n )y n + ... 
+ a n b n y 2n 

then the Hadamard product f * g is the coefficient of y n . 

Theorem 9.84. The Hadamard product 4 of polynomials defines a bilinear map 
P(n) x P ± [n] — > P(n). If ft < f 2 then f 1 * g < f 2 * g. 

Hi does not define a map P x P > P. For instance { x 2 — 1 ) * ( x 2 — 1 ) = x 2 + 1 is not in P. 
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Proof. Without loss of generality we may we assume that g e pp° s . Since 
g e ppos jj. g reversa j j s j n ppos^ an( j ^ e homogenization G[x,y) is in P2. Since 
f * g is a coefficient of f (y)G(x,y) it is in P. Interlacing follows since for fixed 
g the map f 1— * f * g is a linear map P — ► P. □ 

Actually f (y) is not in P2, but rather in the closure of P2 (see Chapter [Till. 
However, the coefficients of f (y)G(x,y) are in P. 

If we look more closely at the product f (y ) G (x, y ) we see that it equals 

• • • + (f * xg) y 1 ^ 1 + (f * g] y n + -(xf * g) y n+1 + • • • 

x 

Consequently 

Corollary 9.85. Iff, g, e P pos (n) then 

(1) ^*^ 2 ^(^-) ^(f*xgHxf*g) forx^Q 



(2) 



f * g xf * g 
f * xg xf * xg 



^ for x ^ 



1 

(3) f * xg (xf*g)<Cf*g 

x 

Proof. The first part is Newton's inequality Cp. 1106b . The second part follows 
from the first and the identity xf * xg = x(f * g). Since f * g and -j-(xf * g) 
interlace, and -(xf * g) has degree n — 1 we have the inter lacings 

1 

f*xg^Cf*q<c-(xf*g) 

which implies the last part. □ 

If we rewrite the last part in terms of coefficients we get 
Corollary 9.86. I/f = L ^tx 1 , g = £ bix 1 and f, g e P pos then 

^ot(b W -i-b l )x t GP. 

Surprising, we can do better than Theorem l9.84l We use a differential op- 
erator in P2, and then evaluate it at to get a polynomial in P. See Corol- 
lary [9TT00l for an alternate proof. Define 

iii filx 1 i = j 
x 1 * x 3 = < 

1 otherwise 

Notice that f *' g = EXP -1 f * g. In terms of coefficients 

f *' g = a b + llaibix + 2!a 2 b 2 x 2 + • • • + n!a n b Tl x n 
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Theorem 9.87. Iff G P ± [n), g G P[n) then f *' g G P(n). 

Proo/. Again assume that f G P pos . Let G[x,y) be the homogeneous poly- 
nomial in P2 corresponding to g. By Lemma [10.621 we know f(-^-)G G P2, 
so evaluation at x = gives a polynomial in P. Consequently, the map 
f x y 1 — ► (f (y-)G)(0,y) defines a linear transformation P* (u) xP(n) — > P(tl). 

The polynomial f *' g is the reverse of (f(£)G)(0,x). To verify this, we 
check it for f = x T and g — x s : 



□ 



ax/ ^ s )(o,x) = 



Corollary 9.88. Iff eP* (n), gi < g 2 G P(n) tTien 

f *' gi<f *' g 2 

The last lemma showed that P ± (n) * P(n) C EXP(P). If we allow analytic 
functions then we get equality. 

Corollary 9.89. P^°~ s * P = EXP(P) 

Proo/. By taking limits we know that P pos * P c EXP(P). Since EXP(f] = e x * f 
we see the the containment is an equality. □ 

There is a partial converse to Theorem l9.87l 

Proposition 9.90. Suppose that f zs a polynomial. If for all g G P v ° s we have that 
f * g G P, f/zen EXPf G P. if/or all g £ P we have that f*geP, tTien EXPf G P ± . 

Proof. Since (1 + x/n) n G P pos we know f * (1 + x/n) n G P. Now (1 + x/n) n 
converges uniformly to e x , so f * e x G P. The first part follows since f * e x = 
exp(f ), so it remains to show that f G P* in the second case. 

If f = £| CL i xl then f * x x (x 2 — 1) = x^a^x 2 — at) so it follows that cu 
and cii + 2 have the same sign. If 0.1+2 is non-zero, and ai + i = then consider 
f * x x (x + l) 2 — x l {a.i + 2X 2 + at). Since cu+2 and at have the same sign, it 
follows that en = 0. If we consider f * x x ~ j+2 (x +1)' we see that all coefficients 
cu+1, en, at-i, . . . are zero. Thus, f = x x f 1 where f 1 has all non-zero terms. 
Since the signs of the coefficients of f 1 are either the same or are alternating, 
we get that fi G P ± . □ 

As an application of Theorem 19 . 871 consider 

Lemma 9.91. J/f G P then (1 - x)~ n * f is in P for n = 0, 1, 2, . . . . 

Proof. From H7.10.61 we know that EXP (1 - x)~ n is in P~p°* since e x G P^ 
and L n (-x) G P pos . The conclusion follows from CorollaryHjlSl □ 
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The lemma explains the curious fact that f i— > (l + x + x 2 + ...)*f sends P 
to itself, yet (1 + x + x 2 + . . . ) = 1/(1 - x) is not in P. 

If we re-express the lemma in terms of coefficients we find that the map 

x 1 h-> Mix 1 (9.13.1) 

which is just Theorem l7.ll 

9.14 Some differential operators 

We use the fact that polynomials in P factor into linear factors to get differen- 
tial operators on P2. We would like to know that if f G P2 then the differential 
operator f ( ) maps P2 to itself, but the best we are able to prove is when 
f is a product of linear factors - see Lemma llO.621 

Lemma 9.92. J/f e P, g £ P 2 then f(^)g e P 2 . 

Proof. It suffices to show that g + a|£ is in P 2 , but this is immediate from 
9<§f- <X □ 

Here is another differential operator that acts on P2. 

Corollary 9.93. Iff £ P pos and g £ P 2 then f (—57^) g £ P 2 - 

Proof. If suffices to show that h(x,y) = — eTFy ) 9 * s m ^ 2 ^ or positive a. 

Since homogeneity is certainly satisfied since h H = ag H it suffices to check 
substitution. If a £ R then 

g |x, a] = aglx, a] - — I ^-g ) (x, a) 



. 9y / 3x \ 9y ' 

and the conclusion follows from g(x, a) < |^ (x, a) and Corollary 12. 101 □ 
Example 9.94. If we take f = (x - l) m and g = (x + y + l) u then 

(ess - J ) " {x + » + 1|n - | ( - 1)m "' (?) & |x + B + ir 



2i<n 



1 



,-n.-2i 



and so we find that 



y _ t (x+y + ly- - 

1 J i!(m-i)!(n-2i)' ' 



n-2i 

.— ITS 



2i<n 
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The differential operator g(x^j) maps P 2 to itself. 
Lemma 9.95. Ifg G P pos , f G P 2 tfzen g(x^)f(x,y) € P 2 . 

Proof. Since all coefficients of g are positive the homogeneous part of 
g(x^j) f(x,y) has all positive coefficients. To show that g(a^-) f (a,y) e P it 

suffices to note that (b + oc^) f(a,y) G P. □ 
Example 9.96. If we take g = (x + l) m and f = (x + y + l) u then 

m / 



and so 



i=0 



Y_ f^jxMx + y + l) 



i=0 



in), e P 2 



£ I : jxMx + y + l) 



Corollary 9.97. If f G P(n), g G P pos (n), g = £ b t x 1 ten 

H(x,y) = ^biX i f (i) (y) GP 2 

Proof. The homogeneous part of h is 

a n ^b i x 1 (n) i y n - i 

where a n is the leading coefficient of f . Since h H (x, 1 ) = a n n! EXP ( g ) it follows 
that h H is in P. Since all coefficients b^ are the same sign h H (x, 1) is actually in 
ppos yj e now apply Theorem l9.87l for fixed y, which shows that substitution 
holds. 

□ 

Corollary 9.98 (Hermite-Poulin). iff, g are given in Corollary \9.97\ then 

^bif (i) (y) G P pos 
Corollary 9.99 (Schur-Polya). Iff, g are given in Corollary \9.97\ then 

^biX l f w (x) G P 
Here is a proof of a special case of Theorem l9.87l 
Corollary 9.100 (Schur). Iff, g G P pos , f = Y. a i xi g = L b t xi > then 

Y_U a i l> i x i G P 
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Proof. Let G(x,y) be the homogenization of g. Since g e P pos we know that 
G(x,y) e P 2 . Since f (i) (0) = i!a t it follows that 

^i! QibiX 1 = f(x^) G(x,y)| y= o 

and hence is in P. □ 

It is possible for two different linear transformations P — ► P to agree on 
all polynomials up to degree n. For instance, suppose that f has degree n + 1 
and all distinct roots. Choose g € P such that f — g = ax n+1 for some non-zero 
a. If we define 

S(h)=f(D)h and T(h) = g(D)h 

then (T — S)(h) = aD n+1 h, and this is zero if the degree of h is at most n. 
Differential operators satisfy a uniqueness condition. 

Lemma 9.101. Suppose that T(g) = f (D)g where f e P has degree n. IfS is a linear 
transformation P — ► P such that 

Sfx^^Ttx 1 ) forO i < n + 2 

then S = T. 

Proof. Since (x — t) n+3 < (x — t) n+2 for any real t we have 

S(x - t) n+3 ~ S(x - t) n+2 = T(x - t) n+2 

f has degree n so T(x — t) n+2 has a factor of (x — t) 2 and therefore S(x — t) n+3 
has t as a root. Thus 

n+2 



= S(x-t) n+3 (t) =S(x n+3 )(t) + ^ ( ■ ) (-t) n+3 - l S(x l )(t) 

Tt+2 , _s 

= S(x n+3 )(t) + ^ \ \ ) (-t) n+3_i T(x i )(t) 



= S(x n+3 )(t) + 



T(x-t) n+3 (t) -T(x n+3 )(t) 



Now t is a root of T(x - t) n+3 so we conclude that T(x n+3 ) = S(x n+3 ). □ 

If we take f = 1 we get 

Corollary 9.102. IfT is a linear transformation P — ► P such that T(x x ) = x 1 for 
i = 0, 1, 2 then T is the identity. 
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9.15 Realizing interlacing 

There is a close relationship between interlacing and products: A pair of in- 
terlacing polynomials can be realized as the first two coefficients of a product. 
If the degrees are different the representation is essentially unique. A product 
is also determined by its first term and its homogeneous part. 

Lemma 9.103. Assume that fo G P(n) is monic, fi has positive leading coefficient, 
and fo < fi. There exist unique bt, a such that (|9.15.1|l is satisfied, and all a are 
positive. The product is in P2. 

n 

Y\(x + b t + ay) = f (x) + hMy + ■■■ (9.15.1) 

i=l 

Proof. Expanding the product, we find 

TL 

1=1 

rt n. 

Ux)=Y_ c, JJ(x + b t ) 
We can rewrite f 1 in terms of f 

j=i ' 

If we are given fo, f 1 then we find the bi as roots of fo, and the a a s the coeffi- 
cients of fi as in Lemma [1.201 

□ 

Remark 9.104. There is straightforward way to express f < g as a limit of strict 
interfacings. We can find non-negative cu such that 

n(x+cuy +b t ) =f + vg + --- 

It's enough to perturb the constants. For positive e and distinct Ci define 

Y[{x+(a i + e)y + {b i + e i )) =f e +ijg e + ... 

We claim that have that 

1. f e < g e for all positive e and sufficiently small 

2. lim f e = f 



CHAPTER 9. POLYNOMIALS IN TWO VARIABLES 



292 



We only need to check the first claim. Since the roots of f e are — (bt + et) and 
the et are distinct, we see that if the £i are small enough then all roots of f e 
are distinct. Since ai + e is positive it follows that f e < g e . 

We can also specify the first term and the homogeneous part. 

Lemma 9.105. If In € P v ° s and f o € P are monic of the same degree then there is a 
product 

gfr/U) = n(cLiy + x + bt) = f (x) + fi(x)-y H 

i 

suc/j fftaf g G P2 flwd 9 H (x, 1) = h(x). 

Proof. If we write h(x) = J^[(x+ai) and fo(x) = J^[(x+bi) then any ordering of 
the en's and bi's gives a product. Since all at have the same sign, the product 
is in P 2 . □ 

As opposed to the unique representation of f < g as a product, we see from 
Lemma|9ll06]that f <C g are the first two coefficients of many products. 

Lemma 9.106. Given f 1 -C fo there exist bt, Ct, di such that J9.15.2l l is satisfied, and 
the determinants | ^ ^* | ore negative for all i. ITze product is in the closure of Pj_. 

n 

F(x,u) = ]^[(x + bt + Ci u + dtxu) = f (x) + fi(x)u + • • • (9.15.2) 

1=1 

Proof. Define at, Pi, (3 by 

n 

f (x) =n( x +^ 

i=l 

n x / a 

fi(x) = pf (x) + J2p t - 



1=1 x+ai 



Choose di so that Y.^=i di = P- We see that fo, f 1 are the leading terms of the 
product 

n 

]^J(x+ on + dixy + (pi + aidi)y) 

i=l 

since the first term is obviously f and the second term is 

y_ — —(dix+Pi + otidi) = ( y di)f + r pi^ 1 - 

z — x + ^ y-^— y ^j- x + ai 

Identify bi = ai, Ci = Pi + aidi and note that the determinant is equal to Pi 
which is negative since fj -C fo- See see Lemma ll 1 .281 and Corollary II 1 .71 for 
similar results. □ 
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Another consequence of looking at coefficients of products is the following 
lemma, which gives a "canonical" polynomial that interlaces two polynomi- 
als. 

Lemma 9.107. Iff = J~[(x+bi) and ct, di are non-negative, then we have a diagram 
of interlacing polynomials 

f " : Z c i ~~TTT 

A 



Z Cid > 



f 



x + bOtx + b,-: 



Proof. Lemma 110,131 shows that Theorem 19.1131 holds for arbitrary products, 
so the interlacings follow from the fact that the four terms in the diagram are 
the coefficients of 1, y, z, yz in the product 

^[(x + CiU + diZ + bt). 



□ 

Another interpretation of this lemma is that if we are given f, g, h with 
positive leading coefficients, and f < g, h. then there is a canonical choice of k 
9 

such that f k where all interlacings are < . In the case that q = h. = f ' 

H 

the canonical polynomial is 4-. Since these polynomials are the coefficients 
of a polynomial in P3 we will see that their determinant is negative. We can 
verify this directly: 



f 



x + b 



^ Ci xTbT 



f2 (z 



x + bi)(x + bj 
ctdj 



Mi 



x + bi)(x + bj] 



z 



cidj 



x + bt)(x + bj; 



=- f \L 



cidi 



- (x + bt) 2 



< 
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9.16 Karlin's conjecture 

We show that certain matrices formed from the coefficients of a polynomial in 
P2° s are totally positive for positive values of x. A special case of this yields a 
solution to a version of Karlin's problem [41,47,100]. 

A Hankel matrix has equal entries on the minor diagonals. Given a se- 
quence a = [oil, ot2, ...) and a positive integer d we can form the Hankel ma- 
trix H(a;d): 



/ OCi 


0i2 


0L 3 




oc 2 




OCi 


■ • O-d+1 


<*3 


«4 


oc 5 


■ • O-d+2 





O-d-1 a d-2 • 


■ 0C\\ 


Oid+1 


OCd OCd-1 ■ 


. a 2 


(X-d+2 


a d+ i ocd 


• a 3 



H(a;d) = 



\0C d tXd+l OCd+2 • • • 0C 2 d J 

We are also interested in the Toeplitz matrix T(a; d) associated to H(a; d). 
If we reverse all the columns of H(a; d) we get T(a; d): 



T(a;d) = 



\ a 2 d a2d-i oc 2d -2 •■• ocd) 

We can form T(a; d) from H(a; d) by shifting the rightmost column d — 1 
places, the previous column d — 2 places, and so on. Consequently, we note 
the useful relation 

det(T(a;d)) = (-l)^)det(H(a;d)) 

The matrix cp(f ) is a Toeplitz matrix. The first result picks out a submatrix 
of cp(f). 

Lemma 9.108. Suppose that f = ao + aix + a 2 x 2 H e P pos . For any positive 

integer the matrix T( (ao, a.\, . . . ); d) is totally positive. 

Proof. The matrix T( (ao, a.\, . . . ); d), with rows and columns reversed, is a sub- 
matrix of cp(f). □ 

We have an important consequence of this simple observation. 

Theorem 9.109. Suppose that f(x,y) = Y. ftM^ £ P 2 "' s . The Toeplitz matrix 
T( (fo(x),fi(x),f2(x), . . .) ;d) is totally positive for all non-negative x. In addition 

(-1)(") det H( (f (x),f 1 (x),f 2 (x), . . . ) ; d) > Ofor x > 
Corollary 9.110. Iff G P pos then 

(1) T( (f (x), ^jfi, -^r^, ...)); d) is totally positive for positive x. 
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(2) (-l)(f)detH((f(x),q^,^l,^,...);d) ^Oforx^O. 

Proof. Use the Taylor series for f (x + y ) and apply Theorem l9.109l □ 

We can restate the last part by saying that the sign of the determinant of 
the matrix whose ij entry is t^ttj is (— 1)^ whenever f e P pos ,andx ^ 0. 
Karlin's conjecture [41 1 is that 

detH( (f(x),f'(x),f (2) (x),f t3) (x),...) );d) > Oforx ^ (9.16.1) 

It is easy to construct counterexamples to this. 5 Karlin's conjecture is true in 
special cases. Here's an example from [47). 

Corollary 9.111. Define EXP" 1 (xV ) = j!x V,gnd le t f s P pos . If 
EXP^ 1 (f(x + y)) S P2 then Karlin's conjecture (|9. 16.11 1 /toZds. 

We can also apply Theorem |4.23| to determinants of translates of a function. 
Suppose A(x) = x + 1 and let T(u Tt ) = [y) . Recall that 

i 

We want to determine the action of e Ay on polynomials: 

e A «x n = Y ^-(x + i) n 
* — 1! 

r,i ^ ' 

= e^T^tx + yr 
where T~ : is the induced map. By linearity 

e A ^f(x) = e^T7 x f(x + y) 

_ ^ A fc f(x) y 
t— k! y 

k=0 



5 f = (x + l)(x + 2)(x + 3) and d = 3 is one. 
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Now we know that T 1 : P pos — ► pp° s so f or aiw positive value a of x 
[T- x f(x + y)] (a,y) =T- 1 f(a + u) e P pos 

since f(a + y) S P pos and consequently e^T" 1 f(x + y) s P pos for any positive 
x. We can now apply Theorem 14. 23 1 to conclude 

Corollary 9.112. Iff e P pos then T[[f, Af, A 2 f/2!, A 3 f/3!, . . . ); d) is totally posi- 
tive for all x ^ 0. 



9.17 Determinants of coefficients 



In Lemma 14.341 we saw that the signs of certain two by two determinants of 
interlacing polynomials were always positive. Since any interlacing f < g can 
be viewed as the first two coefficients of a polynomial in P2 by Lemma 19.1031 
we can generalize these results to determinants of coefficients of a polynomial 
inP 2 . 

So, let's begin by considering the coefficients of xH)' in a product. 



Theorem 9.113. Suppose f e P2, f (x, 0) has all distinct roots, and set 
f(x,y) =£dijxV =Hfi(x)y i 



(9.17.1) 



For 1 ^ i, j ^nw /icroe 



rows 



(columns) 



(squares) 



dij d i+ i,j 

di+ij di + 2j 

dij di,j+i 

di,j+l dy+2 

dij d i+ ij 



<0 



< 



< 



dij+i dt+ij+i 

If f/ze roofs o/f(x,0) aren't all distinct then all < signs are replaced with ^. 

Proof. We know that fo < fi < • • • . The (rows) inequalities are Newton's in- 
equalities since all fi G P. The (columns) results follow upon writing the 
sum as Y_ 9i(y )x x . These inequalities are ^ since it necessarily true that f (0,y ) 
has all distinct roots - consider (x + y + 1) (x + 2y + 1). The (squares) are 
negative since consecutive fi's interlace (Corollary 14.311 1. □ 



Remark 9.114. We can also derive this from Lemma [9. 191 in the same way that 
Newton's inequalities are derived from properties of the quadratic. 

There are polynomials in P2 for which all terms of odd degree are zero, For 
example, consider the coefficient array of ((x + y ) 2 — l) 3 . 
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-1 




3 




-3 . 1 




6 




-12 


. 6 


3 


-12 


-18 


20 


15 


-3 


6 


15 






1 











In such a case we always have alternating signs. 

Lemma 9.115. If f (x,y) e P2 /zas t/ze property that all the coefficients of odd degree 
are zero then the signs of the non-zero coefficients alternate. 

Proof. If f [x,y) — Y_ fiMy v then Lemma |231 implies that the signs of the non- 
zero coefficients of each fi alternate. Writing f(x,y) — Y_ 9i{y) xX shows that 
the signs of each Qi alternate. Thus all the signs in the even rows and columns 
alternate, as do the signs in the odd rows and columns. 

If f = a ij xX V } then the quadralateral inequality applied to the coeffi- 
cients {q o, aio, a i, an} yie;ds a oau - Q10Q01 < 0. Since a i = a i0 = we 
conclude that aoo and an have opposite signs. Thus, all non-zero signs alter- 
nate. 

□ 

Example 9.116. When f € P2 is given by a determinant then the positivity of 
certain determinants in Theorem l9.113l is a consequence of a classical determi- 
nantal identity. Suppose f = |xD + uE + C where D, E are diagonal matrices 
with positive entries, and C is symmetric. Assume the coefficients are given 
the expansion in (9J71). Let A = det(C), Ai = det(C[i]) ; and Ay =det(C[i,j]) 
where C[i] is C with the i-th row and column deleted. Then 

d 00 = A 
d i = Y_ Q vAi 
dio = Y_ eiAi 
dn = Y_ diejAij 

d:d;:|=Z AA ^ d ^-Z A ^ A i d ^ 

= L|A ) ^ t ) |cid j -^A?d i e i <0 



since 



A A t 
A, A„ 



is negative by Sylvester's identity for the minors of the adjugate. 
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Suppose that f = ^ ayxhj' is in Pi^tn), and has all positive coefficients. 
We are interested in the matrices 



/a 



n.,0 



ao,o 



and 



/ a 0,i 



\ Q 0,0 







Q n,oy 



(9.17.2) 



The determinant is Yl a i,n-i which is positive since all the coefficients of 
f are positive. In addition, Theorem 19.1131 implies that all 2 by 2 subma trices 
I ar+ts+i ars'+i I are positive. This is not terribly exciting, but there is a more 
interesting property of matrices to consider: 

Definition 9.117. A polynomial f (x,y) has totally positive coefficients if either 
matrix in J9.17.21l is totally positive. Note that if one of the matrices is totally 
positive, so is the other. 

Example 9.118. Consider a concrete example: choose 

f=(l+x + y) (2 + x + y) [l+x + 2y). 

If we expand f we get 

2 + 5x + 4x 2 +x 3 + 7y + 11 xy +4x 2 u + 7y z + 5xy z + 2y 3 

and the corresponding matrix is 

(2 7 7 2\ 
5 11 5 
4 4 
\0 1/ 

The four by four determinant is 40. The matrices of the determinants of all 
three by three and all two by two submatrices are non-negative. 

We are interested in polynomials in P^ with totally positive coefficients. 
There is a special case where we can explicitly compute some of these deter- 
minants: f = (x + y + l) n . 



Lemma 9.119. For d 



0,...,n 



det 



\L n c 



n 

i=0 



Proof. 6 The i, j entry of the matrix is mztth! (Tt-i+j-d)! 1 K D is the determi- 
nant, then we can factor n! from all the entries, and ( d— j ) ! from all the columns 
so that 



D = 



i!) d+1 
0!---d! 



det 



i!(n-i + j-d}! 



6 Thanks to Ira Gessel for this argument. 
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If we multiply each column by l/(u — i + ) — d) ! then 

(n!) d+1 , . /Yn-d + j 



D = 



(0!---d!)((n-d)!---(n) 



det 



The last determinant has entries that are polynomials in n. If we set pi. — 
( n T d ) then the matrix is (pi(n + }))■ Using a generalization of the Vander- 
monde determinant 11081 we can evaluate this determinant, and the result is 
exactly the desired formula. □ 

Lemma 9.120. Iff has totally positive coefficients then l£ has totally positive coeffi- 
cients. If a is positive then (x + a)f has totally positive coefficients. 

Proof. The argument is best described by an example, so assume that f has 
total degree 3. The determinants of the submatrices of the derivative matrix 
are just constant multiples of the determinants of the original matrix: 




Next, assume that f e Pj has totally positive coefficients, and let a be posi- 
tive. The coefficient matrices of f and (x + <x)f are 



/ a 3,0 




V o 



/ £13,0 




V o 

a 2/0 + oca 3t0 
a 2 ,i 





Q2,0 
Q2,l 






H,o 
Q i,i 
ai,2 




ai,o + aa 2 ,o 
a u + aa 2 ,i 

ai,2 




Q o,o\ 
Qo,i 

0-0,2 
a 0,3/ 

a ao + aai,o 
a 0/1 + ota u 
a 0/2 + aai, 2 
a ,3 



aa 1/0 
aa , 2 



We can write the coefficient matrix of (x + a)f as a product 



/ a 3,0 




V o 



i 2 ,o 
Q2,l 







ai,o 

ai,2 




ao,o 
ao,i 
ao,2 
ao,3 






0/ 



/I 





Vo 



a 
1 







a 
1 







a 
1 




/ 



The first matrix of the product is totally positive by assumption, and the sec- 
ond is easily seen to be totally positive. Since the product of totally positive 
matrices is totally positive [100] it follows that the coefficient matrix of (x + ct)f 
is totally positive. □ 
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Corollary 9.121. Iff, g 6 P pos /tape positive leading coefficients then f (x)g(u ) has 
totally positive coefficients. 

Proof. First of all, g(y) has totally positive coefficients since the matrix of 

/b n ... bi b \ 
... 

V ... 0/ 

If we write f (x) = a(x + aj) . . . (x + a n ) then we can inductively apply the 
lemma. □ 

We conjecture (Question 1 1021 that all polynomials that are product of lin- 
ear terms in P^ 08 have totally positive coefficients. It is not the case that all 
polynomials in are totally positive. Consider the example where 

/l 0\ /13 9 7\ (5 7 8\ 
M= 1 +x 9 7 5 +y 7 11 12 
\0 1/ \7 5 4/ \8 12 14/ 

The determinant of M is in P^ 08 and equals 

1 + 24 x+ 16x 2 + 2x 3 + 30u + 164xu + 62x 2 y + 22 y 2 + 64 xy 2 + 4y 3 

with coefficient matrix 

/4 0\ 

22 64 

30 164 62 
\1 24 16 2/ 

The determinant of the three by three matrix in the lower left corner is 
—1760. Of course, all the two by two subma trices have positive determinant. 

We can't show that f e P^ 08 has totally positive coefficients, but we can 
show that the matrix of coefficients is TP2. This is a simple consequence of the 
rhombus inequalities, and reflects the geometric fact that the the graph of the 
coefficients is log-concave. 

Lemma 9.122. Iffe F p 2 ° s then the coefficient array H9.17.21 is TP 2 . 

Proof. We describe a special case; the general case is no different. Suppose 
f = Y. ly* 1 !/'/ an d we wish to show that | UjJJ ^ | is non-negative. Consider 
a portion of the coefficient array 
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0-0,2 0Ll,2 &2,2 




ao,i ai,i a 2 ,i 




a 0,0 a l,0 0-2,0 

From the rhombus inequalities we have 

Oo,i a-1,0 ^ o ,o cli,i 
ai,i a 2 ,o > ai, a 2 ,i 
ao,2 ai,i ^ ao,i a u 
ai, 2 a 2 ,i ^ Qi,i a 2j2 

Multiplying these inequalities and canceling yields the desired result: 

O0,2 O 2/ ^ Oo,0 Q 2 ,2 

In general, the various vertices contribute as follows 

corner The corner vertices each occur once, on the correct sides of the inequal- 

ity 

edge The vertices on the interior of the boundary edges each occur twice, 
once on each side of the inequality. 

interior The interior vertices occur twice on each side of the inequality. 

Again, multiplying and canceling finishes the proof. □ 

9.18 Generic polynomials and 

In one variable the interior of the space of polynomials consists of polynomials 
whose roots are all distinct. The analog in P 2 is polynomials whose substitu- 
tions have all distinct roots. We also have the analog of P sep , the set of all 
polynomials whose roots are at least one apart from one another. 

Definition 9.123. Pf en consists of all polynomials f (x,y) in P 2 such that 

1. f (x, a) has all distinct roots, for all a e R. 

2. f(a,x) has all distinct roots, for all aeK, 

If we define A x f (x,y) = f (x + l,y) and A y f (x,y) = f(x,y + 1) then P^ 
consists of all those polynomials in P 2 satisfying 

• f <A x f 
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For any fixed awe see that= both f (x, a) and f (<x,x) are in P sep , so f G Pf en . 

The graph of a polynomial in P| en (n) is easy to describe: it's a set of n 
non-intersecting curves. If f is in then all the curves are at least one apart 
on every vertical line, and they are also at least one apart on every horizontal 
line. In addition, 

Lemma 9.124. Iff e Pf™ then the solution curves are always decreasing. 

Proof. If the graph of f (x,y) = had either horizontal or vertical points, then 
by Lemma [9.301 the graph would have double points. However, the solution 
curves are disjoint since f G Pf en . We conclude the the slope is always the 
same sign. Since the solution curves are asymptotic to lines with negative 
slope, the solution curves are everywhere decreasing. □ 

Example 9.125. We can use polynomials in P sep to construct polynomials in 
P^. If f (x) € P sep , then f (x + y) is easily seen to be in Ff p . The graph of 
f (x + y ) is just a collection of parallel lines of slope 1 that are at least one apart 
in the x and y directions. 

We can easily determine the minimum separation of the roots when the 
polynomial is a quadratic given by a determinant. We start with one variable. 
Define 



f(x) = 



aix + a y 
y a 2 x + (3 



(9.18.1) 



The square of the distance between the roots of a polynomial ax 2 + bx + c 
is (b 2 — 4ac)/a 2 . The squared difference between the roots of f is 

1 \ 

r ((a + y) 2 -4Q 1 Q 2 (a|3 -y 2 )) = ^ T (4aia 2 y 2 + (aa 2 - |3ai) 2 ) 



n 9 V v ' X I iujn Zl ^p r II— 2 2' 

a^CLj aja 2 



and hence the minimum squared distance is 4y 2 /(aia 2 ). To find the minimum 
of 

aix + biy + a y 

y a 2 x + b 2 y + |3 



f(*,y) = 



(9.18.2) 



we note that if y is fixed then by the above the minimum distance is 
2|y|/v /Q i a 2- h x is fixed then the minimum distance is 2|y|/v / bib 2 . It follows 
that 

Lemma 9.126. The polynomial f (x,y ) of (19.18.21 is in P 2 cp iff 
|y| ^ - max ^a 1 a 2 , V^ib^j 
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Remark 9.127. The condition that the solution curves of a polynomial in P^ ep 
are at least one apart in the horizontal and vertical directions does not imply 
that they are one apart in all directions. From the above we know that f = 
(x + y + 1/2) (x + y — 1/2) 6 . The line x = y meets the graph, which is two 
parallel lines, in the points ±(1/4, 1/4). Thus, there are two points on different 
solution curves that are only s/2/2 = .707 apart. 

Example 9.128. The polynomial xy — 1 is a limit of polynomials in Pj • We 
know that the limit 



lim 

e->0+ 



+ ey 1 

1 £X+t) 



xy -1 



expresses xy — 1 as a limit of polynomials in P2. In order to apply the lemma 

e" 
x+ey 



we need to multiply each side by I i —1/2 L which yields 



lim e 

e^0 + 



x+e 



xy - 1 



Since 2\cu\ — 2|ci2|/\/did2 = 2e 1/2 we see that the roots are at least one apart 
when < e < 1/4. 

Polynomials in P^ ep satisfy the expected properties. 

Lemma 9.129. 

1. Pf™ and Y^" are closed under differentiation. 

2. The coefficients of any power ofy (or ofx) of a polynomial in P^ are in P sev . 

Proof. The statement follows since differentiation preserves the properties of 
all distinct roots, and of interlacing. If we differentiate sufficiently many times, 
and substitute zero we can conclude that all coefficients are in P sep . □ 

We can use properties of symmetric matrices to construct polynomials in 
Pf n . We first show El page 124] 

Lemma 9.130. If Mis a symmetric matrix whose superdiagonal entries are all non- 
zero, then the eigenvalues of M are distinct. 

Proof. If we delete the first column and the last row of <xl + M, then the re- 
sulting matrix has all non-zero diagonal entries, since these entries come from 
the superdiagonal of M. Since the product of the diagonal entries is the prod- 
uct of the eigenvalues, the matrix is non-singular. Thus, if M is n by n, then 
al + M contains a submatrix of rank n — 1 for any choice of a. If M had a 
repeated eigenvalue, then there would be an a for which al + M had rank at 
most n — 2. Since al + M has rank at least n — 1, this contradiction establishes 
the lemma. □ 

Corollary 9.131. Suppose C is a symmetric matrix whose superdiagonal is all non- 
zero, and D z's a diagonal matrix of positive diagonal entries. Then, 

IxI+yD + C| S if" 

Moreover, xl +yD + <xC| G P^ for sufficiently large a. 
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Proof. Let f (x,y) = |xl + yD + C|. The roots of f (x, a) are the eigenvalues of 
ocD + C, and they are all distinct by the lemma. If f (a.,y ) —0 then 



ccD" 1 + yl + vT]RC\/EF T 







Thus, the roots of f (a,y) are the eigenvalues of <xD 1 + VD 1 CVTD l . Since 
VD _1 CVD _1 satisfies the conditions of the lemma, the roots of f (ct,y) are all 
distinct. Thus, f(x,y) <E Pf". 

The roots of |xl + yD + <xC| are the roots of f (5/ ~L an d s ° the second part 
follows. □ 

A similar argument shows the following. 

Corollary 9.132. If Di is a diagonal matrix with distinct positive entries, and D2 is 
a positive definite matrix whose superdiagonal is all non-zero, then 

H+xD 1 +uD 2 |eif"nP?r. 

Corollary 9.133. Every polynomial in P 2 is the limit of polynomials in Ff n . 

Proof. By Corollary 19.771 if f(x,y) e P2 then there is a diagonal matrix D and 
a symmetric matrix C such that f (x,y ) = |xl + yD + C|. Let U e be the matrix 
that is all zero except for e's in the sub and super diagonal. The polynomial 
|xl + yD + C + U £ | is in P| en for |e| sufficiently small, and it converges to f (x,y ) 
as e — » 0. 

□ 

Example 9.134. Here is an example of a cubic polynomial in P^. Let 



f(x,y) = 



x + y 1 
1 x + 2y 1 
1 x + 3y 



= -2x + x 3 -4y+6x 2 u + 11 xy 2 + 6y 3 



The roots of f (x, ct) are 



—2 oc, —2 a — \/l + a 2 , — 2 a + \/l + a 2 

and the minimum distance between these roots is clearly \fl. If f(a,y) = 
then the roots are 



-a -2 oc - V6 + a 2 -2a+V6~ 
~2~' 3 ' 3 

and the minimum is seen to be 1/V2- Consequently, replacing y by y /V2, x 
by \/2x, and multiplying by \/2 yields 

-4x + 4x 3 - 4y + 12x 2 y + llxy 2 + 3y 3 € ?f p 
and is a polynomial satisfying 6 X = S y — 1. 
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The hyperbola xy — 1 is a good example of a polynomial whose graph has 
no intersections. This can be generalized. 

Lemma 9.135. If D is a positive definite matrix with all distinct roots, and g G P2 
then |D - gl| e Pf". 

Proof. If D is an n by n matrix then the homogeneous part of |D — gl| is g n . 
It suffices to show for each cc that |D — g(x, <x)I| has all distinct roots. There 
are a, b, c such that g (x, a) = a(x + b) (x + c) . After translating and scaling we 
may assume that g[x, oc) — x 2 — 1. Now 

|D-g(x,a)I| = |(D + I)-x 2 I| 

Since D has all distinct eigenvalues, |D + I — xl has all distinct positive roots. 
Taking square roots shows that |D + I — x 2 I| has all distinct roots. □ 

9.19 Integrating generic polynomials 

We can integrate generic polynomials. To do this we first need to introduce 
some quantitative measures of root separation. We extend the one variable 
measure 6(f) of separation to polynomials in P2: 

Mf) = inf 6(f(x,a}) 

ccGR 

6 v (f) = inf 6(f(a,y)) 

aGR 



If the solution curves of f cross, then both 5 y (f) and 6 x (f) are zero. Thus, 
f € Pf en if either 6 X (f ) > or 5 y (f ) > 0. For an example, the polynomial 

f[x,y) = (x+ cry)(x+ ay + b) 

has 6 x (f) = b and 6 y (f) = b/a. This shows that (6 x (f ), S y (f )) can take any 
pair of positive values. 

Applying (|3. 6.11 1 to f € P 2 yields 

s s; t ^ 5 y (f) =^ (Va6l)f(a,u + s)<f(a ( y+t) 

This shows that f e if and only if 5 X (f ) ^ 1 and 5 y (f ) ^ 1. 

Proposition 9.136. Suppose that f e Pf™, < a ^ 6 y (f), and w(x) is a non- 
negative integr able function on R. If we define 



g(x,y) 

then g(x,y) G Pf™, and b y (g) ^ j(x 



f(x,y + t)w(t) dt 
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Proof. Since g H = (J"w(t) dt) f H , we see that g satisfies the homogeneity 
condition. Proposition 13.371 implies that g satisfies substitution, and Proposi- 
tion [3T42] shows that the separation number is as claimed. Since b y (g) > 0, we 
know that g e Pf n . □ 

Remark 9.137. If f e P^ ep then J* f (x,y + t) dt is in Pf en , but even the simplest 
example fails to be in Pj • Consider 

f(x,y) = (x + y)(x + y + 1) 

■l 

f(x,y + 1) dt = g(x,y) = 5/6 + (x + y)(x + y + 2) 

o 

5 x (g)=5 y (g) = V273 <i 

Here's a curious corollary. 

Corollary 9.138. Suppose that f e Ff", and b y (f ) > 1. If ^ h = f, to A y (h) e 
Pf™. " ' 

Proof. Take w = 1, and a = 1. The corollary follows from Proposition [9T36| 
since 

pi 

f(x,y + t)dt = h(x,y + 1) -h(x,y) = A y (h) 

□ 

If f G P, then it follows from the definitions that 

6(f) =6 x f(x + y) =5 y f(x + y) 

Consequently 

Corollary 9.139. Ifi e P, w(t) positive, and < a < 6(f) then 

f(x + y +t)w(t) dt e Pf™. 

9.20 Recursive sequences of polynomials 

In this section we show how to construct polynomials in P2 that are analogs of 
orthogonal polynomials. The next lemma (a special case of Lemma [l0.1121 is 
the key idea, for it allows us to construct a new polynomial in P2 from a pair 
of interlacing ones. 

Lemma 9.140. Ifi<g are both in P2, and a, b, d are positive then 

(ax + by + c)f - dg < f 

Two constructions are immediate consequences of the lemma. The first 
one shows that certain two-dimensional analogs of orthogonal polynomials 
are in P2. 
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Lemma 9.141. Let a n , b n , d n be positive, and define a sequence of polynomials by 

P_i =0 
Po = l 

Pn+i = (a n x + b n y + c n ) P n 



d^P 



n r n— 1 

Then all P n are in P 2 and 

P > Pi >P 2 > • • • 

Note that P n (x, a) and P n (a,y ) are orthogonal polynomials for any choice 
of oc. These polynomials also have a nice representation as determinants that 
shows that they are in P 2 . If we let li — a^x + b|U + Ci then we can construct 
a matrix 



/ h 


v 7 ^ 





.. 





o\ 








.. 















Vdl .. 
















u .. 








V o 








.. 







whose determinant is P n . If we set t\ — Ci then the matrix is called a jacobi 
matrix, and its characteristic polynomial is an orthogonal polynomial in one 
variable. 

Lemma 9.142. Let a n , b n , d n , e n be positive, and define a sequence of polynomials 
by 

P_i =0 
Po = l 



,3 3 , „ 

d n ^- + e n — I P r 

3y 3x, 



Pn+i = (a n x + b n y + c n ) P r 
TTzen all P n are in P 2 and Po>Pi >P 2 2> • • 

9.21 Bezout matrices 



Any two polynomials f, g of the same degree determine a matrix called the 
Bezout matrix. We give a simple argument that the Bezout matrix is positive 
definite if and only if f and g interlace. We also investigate properties of the 
Bezout matrix for polynomials in P 2 . 

Definition 9.143. If f (x), g (x) are polynomials of the same degree n the Bezout 
matrix of f and g is the symmetric matrix B(f, g) = (bij ) such that 



f(x)g(u)-f(u)g(x) 

x-y 



= y_ bijX^V- 1 



CHAPTER 9. POLYNOMIALS IN TWO VARIABLES 



308 



In order to present the key result II 1521 we introduce the Vandermonde 
vector. 

V n (x) = (l,x,...,x n - 1 ) t . 

Theorem 9.144 (Lander). Iff, g are polynomials of degree n with no common zeros, 
and the zeros of f are T\, ... ,r n thenB(f,g) = (V t ) _1 DV _1 where 

V=(V n (T 1 ),...,V n (r n )) 
D = diag(g(T 1 )f / (n) g(r n )f'(r n )) 

Corollary 9.145. If f € P and f , g have no common roots then f and g interlace if 
and only if the Bezout matrix is positive definite. 

Proof. If the Bezout matrix is positive definite then g and f ' have the same 
sign at the roots of f. Since f ' sign interlaces f, so does g. The converse is 
similar. □ 

This is a classical result. More interesting and recent is that using the Be- 
zout matrix we can parametrize the solution curves of certain polynomials in 
P 2 . We paraphrase Theorem 3.1 |152[ 

Theorem 9.146. Three polynomials fo,fi,f2 in one variable determine a map from 
the complex line C to the complex plane C 2 : 



fo(z) f 2 (z) 



(9.21.1) 



.fiW'fi(z), 

The image is a rational curve defined by the polynomial 

A(x,y) = det(B(f ,f 2 ) +xB(f ,fi) +yB(f 2 ,f a )) (9.21.2) 

Choose mutually interlacing polynomials fo, f i, f 2 in P(n). Each of the Be- 
zout matrices B(fo, fi), B(fo,f 2 ), B(fi,f 2 ) is positive definite since each pair 
is interlacing. It follows that A(x,y) in 119.21. 21 belongs to P!? 06 . The solution 
curves are parametrized by 1 19.21. It . 

Consider an example: 



fi = (2 + x)(8 + x)(12 + x) 
f 2 =(3 + x)(9 + x)(13 + x) B(f ,fi 
f 3 =(7 + x)(ll + x)(15 + x) 

90456 19074 963 
B(f ,f 2 )=| 19074 4109 211 ) B(fi,f 2 ) 
963 211 11 



12600 


2922 


159 


2922 


785 


47 


159 


47 




89568 


17292 


804 


17292 


3440 


164 


804 


164 


8 



A{x,y) = (17325x 3 +175200yx 2 +155803x 2 +242112y 2 x+465664yx+191775x 
+ 73728y 3 + 240384y 2 + 223136y + 61425)/17325 
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Figure 9.9: The graph of a parametrized P2 



All solutions to A(x, y ) =0 are of the form 

(t + 2)(t + 8)(t + 12) (t + 7)(t+ 11) (t + 15) 



y 



(t + 3)(t + 9)(t + 13) " (t + 3)(t + 9)(t + 13) 



The graph of A[x,y) is in Figure [9791 The three curves are parametrized as 
follows: 



the bottom curve t ^ (—13, —3) 
middle on left, top on right t e (—13,-9) 
middle on right, top on left t e (—9, —3) 

Now suppose that f<g e P2. The Bezout matrix with respect to x is 

D u \ f(^,y)g(z,y) -f(z,y)g(x,y) 
B x (f,gJ = 

x — z 

Since f (x, a) and g(x, a) interlace for all ait follows that B x (f, g) is a matrix 
polynomial in one variable that is positive definite for all y. In particular, 
v 1 B x (f, g) v > for all non-zero vectors v. However, v t B x (x,y)v is generally 
not a stable polynomial. 

The polynomial that determines the Bezout matrix has all coefficients in P. 



Lemma 9.147. Iff < gin P and 

B(x,y) = 



x-y 



f(x) 
9M 



f(y) 
g(y) 



then all coefficients of powers ofx or ofy are in P. Iff, g e pv° s then all coefficients 
ofB are positive, and all coefficients are in P v ° s . 
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Proof. Write g(x) = Y_ a i^~r where qi ^ 0. Then 

B(x,y) = -±- (f(x) Y_ ai^- f(y) J" Qi-^ 
f(x) f(y) 



(9.21.3) 
(9.21.4) 



Since {f(x)/(x — r-jj} is mutually interlacing the result follows from 
Lemma [9.181 If f, g G P pos then f(x)/(x — r{] has all positive coefficients, 
so B has all positive coefficients. □ 

Remark 9.148. It is not the case that B(x,y) S P2- If it were then B(x,x) is in P, 



but B(x,x) = 



f(x) f'(x) 
g(x) g'(x) 



which is stable by Lemma T21. 771 



Chapter 

10 



Polynomials in several variables 



10.1 The substitution property 

Just as for one variable there is a substitution property that all our polynomials 
will satisfy. 

Definition 10.1. If f is a polynomial in variables X\, . . then f satisfies Xi- 
substitution if for every choice of ai, . . . , aa in M the polynomial 

f(a 1/ ...,a i _ 1 ,x i , a i+ i,...,a d ] 

has all real roots. We say that f satisfies substitution if it satisfies Xt-substitution 
for 1 < i s$ d. 

It is clumsy to write out the variables so explicitly so we introduce some 
notation. Let x = (xi, ...,x&),a.= [a.\,...,a&), and define 

x- = (ai,...,ai_i,Xi, a i+1 ,...,a d ) 

The degree of a monomial xj 1 • • • x^ d is the sum ii + • • • + i d of the degrees. The 
total degree of a polynomial is the largest degree of a monomial. The x—degree 
of a polynomial is the largest exponent of x-j.. An index set I is a sequence 
[\\, \i, . . . ) of non-negative integers. A monomial in x can be written 

x — 'yc-t ^2 " ° ° 

The size of an index set is |I[ = ii + i% H — • . The degree of x 1 is |I|. 

We can write a polynomial f in terms of coefficients of monomials. If x = 
[x\, . . ., Xd) and y = (yi, ■ ■ ■ ,y e ) then we write 

flxjl^^f.fxly 1 (10.1.1) 
i 

We can restate the definition of Xi-substitution: 
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• f (x) satisfies Xi-substitution iff f (x?) G P for all a G M d . 
All the polynomials we consider will belong to this set: 

Subd = {all polynomials in d variables that satisfy substitution} 

In the case d = 1 the set Subi is just P. Consider these examples in several 
variables. 

Example 10.2. All products of linear factors such as 

(xi +2x 2 -x 3 + l)(xi -x 2 )(x 2 + x 3 ) 

satisfy substitution. More generally, if the factors of a product are linear in 
each variable separately then the product satisfies substitution. An example 
of such a polynomial is 

(xix 2 + xi + 3)(xix 2 x 3 + 2xix 4 + 1) 

Lemma 10.3. Iff is a polynomial, and there are f j G Subd sucn that lim f j = f then 
f satisfies substitution. 

Proof. Observe that for any a G R d and 1 ^ i ^ d we have 

f(x?) = limfjfx?) 

) — >oo 

The result now follows from the corresponding result (Lemma 11.401 for one 
variable. □ 

Substitution satisfies a factorization property. 
Lemma 10.4. If f g G Sub d then f, g G Sub d . 

Proof. The result is obvious for polynomials in one variable. The result follows 
from the fact that (fg)(x?) = f(x?) g(x?). □ 

The reversal transformation is one of the few operations that preserve the 
property of substitution. 

Proposition 10.5. If f has Xi-degree ki and f = Y. a i %1 then the reverse of f is 
defined to be 

rev(f) = Y_ a/**" 1 
where K — (ki, . . . ). If f G Subd then rev(f) G Subd- 
Proof. Let a = (ai, . . . , a&). If we substitute we see that 

rev(f](x?) = Y_ Q i a ^ 1_il ' ■• x i i_ii ' ' ' a d d_id 

= af 1 -..a^^a I a^ 1 • • • • • • 

= a] Cl ...a^rev(f(x^) 

where b = (1/ai, ...,1/dd). Since reversal preserves roots in one variable, 
rev(f) satisfies substitution. □ 
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Interlacing can be easily defined for polynomials satisfying substitution. 

Definition 10.6. If f and g are polynomials that satisfy substitution, then we 
say that f and g interlace if for every real a the polynomial f + ag satisfies 
substitution. 

The relation of interlacing is preserved under limits. 

Lemma 10.7. Suppose ft, gt are in Sub,}, and limi—oofi = f, limi_>oo gi = g 
where f, g are polynomials. If f\ and gt interlace for all i then f and g interlace. 

Proof. Since fi + agi is in Subd for all i, and fi + ag\ converges to f + ag it 
follows that f + ag G Subd- Consequently f and g interlace. □ 

10.2 The class P d . 

Before we can define P a we need a few definitions that generalize the defini- 
tions for d = 2. 

Definition 10.8. If f has degree n, then the homogeneous part of f = Y. a i xl is 

f H = Y_ Q i xI (10.2.1) 

|I|=n 

The homogeneous part controls the asymptotic behavior of the graph, just 
as the leading coefficient does in one variable. 

Definition 10.9. A polynomial f (xj, ...,x&) of degree n satisfies the positivity 
condition if all coefficients of monomials of degree at most n are positive. 

We now define 

Definition 10.10. 

P a = all f in d variables such that 

• f satisfies substitution 

• f H satisfies the positivity condition (Definition ll0.9l l. 

Pd is the analog of "polynomials with positive leading sign and all real 
roots". 

We let Pd( n ) denote the subset of Pd that consists of all polynomials of 
degree n. 

The results in Lemma |9 . 1 61 also hold for polynomials in Pd- The hominto- 
geneous part is always in Ud- 

Lemma 10.11. Iff G P d then f H G P d . Iff G Sub d then f H G Sub d . 
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Proof. Since f H = (f H ) H the positivity condition is satisfied. It remains to 
show that if f € Subd then f H satisfies substitution. Assume that the degree 
of f is n, and write 

f= Y c i xI + H c i xI - 

|I|=n |I|<n 

Replace x by x/e and multiply by e n : 

e n f(x/e) = Y_ Cix r + Y_ e n ~ W CiX l . (10.2.2) 

|I|=tv |I|<n 

It follows that e n f (x/ e) converges to f H as e approaches 0. Lemma H0.3l shows 
that f H satisfies substitution. □ 

As is to be expected, substituting for several variables is a good linear 
transformation. 

Lemma 10.12. Let x = (xi, . . . ,x d ), y = (yi,...,y e ) and choose a e R e . The 
substitution map x l yl i— > x 7 aj is a linear transformation P^+e — ► Ua- 

Proof. Using induction it suffices to prove this in the case that e is 1. Let x a = 
(xi, . . .,Xd, a). Choose f e Pa, and let g = f(x a ). Since f satisfies substitution 
so does g. The homogeneous part f(x Q ) H is found by deleting all the terms 
involving x^+i in f H . Since all coefficients of f H are positive, so are those of 
f(x a ) H . it follows that geP d . □ 

Unlike Sub a, not all products of linear terms are in Pa- The lemma below 
shows that the signs must be coherent for this to occur. Note that there are no 
constraints on the signs of the constant terms. 

Lemma 10.13. Suppose 

n 

f(x) = JJ(b k + oikXi + • • • + QdkXd). (10.2.3) 

k=l 

Assume that all coefficients ofx\ are positive. Then f e Pj iff for each 1 ^ i < d all 
coefficients o/xt are positive. 

Proof. The homogeneous part of f is 

TL 

f H M = n(aikXi H V a dk x d ). 

k=l 

If the signs of the coefficients of x^ are positive then the coefficients of 
f H (xi, . . . , xa) are all positive. 
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Conversely, the terms in f H involving only variables xi and Xi are easily 
found. They factor into 

TL 

k=l 

If f H satisfies the positivity property then the coefficients of this polynomial all 
have the same sign. Since the coefficients of xi are all positive, the coefficients 
of xi are positive. This is Lemma I27T1 □ 

Definition 10.14. If f , g e P d then f and g interlace iff f + ag is in ±P d for all 
oc. If in addition the degree of f is one more than the degree of g then we say 
f <_ g. In this case (f + g) H = f H , and so the only condition we need to verify is 
substitution. This leads to an equivalent definition in terms of substitutions: 

If f,g e P d thenf <giff f(x?) <g(x?) for all a e K d and 1 < i sC d. 

The following important result is an immediate consequence of the defini- 
tion. See the proof of Theorem ll.431 

Theorem 10.15. If T: Pd — ► Pd is a linear transformation then T preserves inter- 
lacing. 

Lemma 10.16. Iff, g e P d satisfy f < g then f H < g H . 

Proof. The transformation f f H maps P d to itself. □ 

Both Subd and Pd satisfies a loose form of closure. We will study the 
general question of closure in the next chapter. 

Lemma 10.17. Assume f is a polynomial in d variables that satisfies the homogeneity 
condition. If there are polynomials ft e Pd such that lim f i = f then f is in Pd- 

Proof. Applying Lemma [l0.3l shows that f satisfies substitution. By hypothesis 
f satisfies the positivity condition, and hence f € Pd- d 

Quadratics in 

There is an effective criterion to determine if a quadratic polynomial is in Pd- 
If we write f = Y. a i xI then for all values of x 2 , . ■ ■ , x& the polynomial f must 
have all real roots as a quadratic in Xi. This is true if and only if the dis- 
criminant is non-negative. If we write out the discriminant then we have a 
quadratic form that must be positive semi-definite. 

For simplicity we only consider the case d = 3, but the general case 
presents no difficulties. Suppose that 



f (x,y, z) = qqoo + z a 00 i + z 2 a 002 + y a ow +yz a on + y 2 a 020 + 
x Qioo + x z aioi +xy quo + x 2 a 2 oo 
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then the discriminant is 



aioo 2 + 2 z aioo a 10 i + z 2 ai i 2 + 2y aioo a 110 + 2y z aioi an 
+H 2 Quo 2 — 4 a oo Q200 — 4 z a oi Q200 — 4 z 2 aoo2 CI200 
-4y a io a 2 oo - 4y z a n a 2 oo - 4y 2 a 02 o a 2 oo 



Now we can homogenize this, and consider it as a quadratic q(u,z, w). 
Since the discriminant is non-negative, the quadratic must be non-negative 
for all values of y, z, w. Consequently, the matrix Q of the quadratic form q 



is positive semi-definite. Conversely, if Q is positive semi-definite then 
f(x,t),z) satisfies x-substitution. Note that the diagonal elements of Q are 
positive if and only if f(x,y,0),f(x, 0,z),f(0,y,z) each have all real roots. We 
summarize for d = 3: 

Lemma 10.18. A quadratic polynomial is in P3 if and only if it satisfies the positivity 
conditions and the matrix (|10.2.41 is positive semi-definite. 

We define P^ p to be all those polynomials in P d such that all substitutions 
are in P sep . It is easy to construct quadratic polynomials in P d p . 

Lemma 10.19. The polynomial 



Proof. Notice that Lemma 19.1261 does not depend on constants, so if we sub- 
stitute for all but the i-th variable, the minimum distance between roots is 



10.3 Properties of Pa 

In this section we establish the important fact that P d is closed under differen- 
tiation. An important consequence of this will be results about the coefficients 
of polynomials in P d . 

As in P2 we only need to know that substitution holds in one variable to 
conclude that substitution holds in all variables. This is a very useful result. 




(10.2.4) 



(qjXi H 1- a d x d )(bixi H h b d x d ) - c 2 




2|c|/v / atbi. 



□ 



Theorem 10.20. Suppose that f ;'s a polynomial in d variables, and satisfies (see 
Definition\mM 
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• the positivity condition 

• Xi-substitution for some xt 

then f satisfies substitution, and is in Pa- 
Proof It suffices to show that if f satisfies xi -substitution then it satisfies X2- 
substitution. For any a = (ai, . . ., a^) let g(xi,X2) = f (xi,X2, (13, .. ., a^). Since 
g satisfies xi -substitution and g H is a subset of f H , we can apply Theorem l9.27l 
to conclude that g satisfies X2-substitution. □ 

Theorem 10.21. Suppose that T: Pd — ► Pd is a linear transformation that pre- 
serves degree, and for any monomial x 1 the coefficients of T(x*) H are all positive. If 
y = (y lr . . . ,y e ) then the linear transformation 

T*(xV) ^Tfx^y 7 

defines a linear transformation from Pd+e to itself. 

Proof. The assumptions on the image of a monomial imply that T* (f ) satisfies 
the positivity condition for f e Pd+e- By Theorem 110.201 it suffices to show 
that (f ] satisfies xi -substitution. If we choose bel e then 

(T*f)(x,b)=T(f(x,b)) 

Since f (x,b) S Pd by Lemma ll0.12l we know T(f (x,b)) is in Pd, and so satisfies 
xi -substitution. □ 

An important special case is when d is 1. 

Corollary 10.22. Suppose T: P — > P is a linear transformation that preserves 
degree, and maps polynomials with positive leading coefficient to polynomials with 
positive leading coefficient. We define a linear transformation on Pd by 

T i (xf...x^)=xr...x^T(x^)x^...x^ 

T maps Pd to itself. 

Theorem 10.23. Pd is closed under differentiation. If f e Ph then f <-^- for 
1 < i < d. 

Proof. The linear transformation T of P defined by Tg = g + ag' maps P to 
itself for any a. Applying Theorem ll0.21l shows that f + cigpr is in P^ for all 
a, and the conclusion follows. 

□ 

Corollary 10.24. Ifi — Y_ f/My 1 is in Pd+e and J is any index set then f/(x) is in 
Pd(IJI). 
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Proof. If J = ( ji, . . . , ) e ) then we can apply Theorem 1 1 ,231 since 

1 



fjM 



ji!j2! • • • je! 



y=0 



□ 



In P2 we know that consecutive coefficients interlace. The same is true in 
P a, and the proof is the same. 

Lemma 10.25. Suppose that f (x,y) e Pa+e, and we write 

f (x,y) = • • • + fifx)/ + f/ uyi (xhjjy 1 + • • • 

fe« fj < fiuy r 

Theorem 10.26. 7/f,g,he P d and f <g, f <hthen 

• g + h e P d 

• f <g + h 

• For any f, g e P d then f g e Pd- 

Proof. As we saw for two variables, Lemma 11.101 shows that g + h satisfies 
substitution. Since (g + h) H = g H + h H it follows that g + h satisfies and 
positivity condition. 

The product fg certainly satisfies substitution. Since (fg) H = f H g H we 
see that the positivity condition is satisfied. The last one follows from the 
corresponding one variable result Lemma [1.31| □ 

Two by two determinants of coefficients are non-positive. This is an im- 
mediate consequence of Theorem l9.113l 



Corollary 10.27. Suppose that f (x,y, z) — Y_ hi MyV is in Pa+2- Then 

< 



fi,j (X) f (x) 

fy+iM fi+i,j+i(x) 



If we apply this corollary to the Taylor series for f (xj + y, X2 + z, X3, . . . , x,x) 
we get 



Corollary 10.28. I/f (x) £P d ,d) 2, then 



f 



3x2 3xj 3x2 



0. 



Remark 10.29. If we are given a polynomial f that satisfies the above inequality 
for all pairs of distinct variables, then it does not imply that f G P $ . However, 
this is true for multiaffine polynomials (See p. 13891 ). 
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10.4 The analog of P pos 

P pos is the set of all polynomials in P with all positive coefficients. The analog 
P^ os of P pos is also defined in terms of the signs of the coefficients. P^ 05 is 
closed under interlacing, and the homogeneous part of any polynomial in P <j 
is always in F p ^ s . 

Definition 10.30. ¥ p ^ s (n) is the collection of all polynomials f e P a ( n ) such 
that all coefficients of monomials of degree at most n are positive. 

That is, if f = Y. a i xI has total degree n, then ai is positive for all |I| ^ n. 

We can easily characterize products of linear terms that are in P^. It fol- 
lows from Lemma [10.131 that a product ((10.2.3b is in P p ^ s iff all are positive, 
and all are positive. 

Lemma 10.31. 

1. P^ os is closed under differentiation. 

2. Ifi € P^ os then for any positive awehave'f{a,X2, ...,%d) € P^-i- 

3. Iff € P^+e t nen coefficients f 7 given in ilO.l.H are in P^ os . 

Proof. We only need to check positivity for the first statement, and this is im- 
mediate. The second statement is obvious, but it is false if we substitute a 
negative value. 1 Finally, differentiate sufficiently many times until fi is the 
constant term, and then substitute for y. □ 

We can determine if a polynomial is in Pj° s just by looking at the constant 
terms with respect to each variable. 

Theorem 10.32. Suppose that f € Pa and for each 1 < i ^ d the polynomial 
f(0,...,0,Xi,0,...,0) isinP v ° s . Then, f is in ±F P ° S . 

Proof. We may assume that 

f(0,...,0, Xi ,0,...,0) 

has all positive, or all negative, signs. The proof now proceeds by induction 
on the number of variables. For d = 2 the hypothesis implies that in 

f(xl,X 2 ) =f (Xl)+--- + f n (Xl)x? 

the coefficient fo is in P pos . We may assume that all coefficients of fo are posi- 
tive. Since f o < f i we see that all coefficients of f i have the same sign. Continu- 
ing, all the coefficients of fi have sign e^. Now the hypothesis also implies that 
f (0, X2) is in P pos . Since f (0, X2) contains the constant terms of all ft it follows 
that all £i have the same sign. Thus all coefficients of all the f i are positive. 

1 f = (1 + Xl + x 2 )(3 + X! + x 2 ) is inpP° s ,but f (-2,x 2 ) = x§ - 1 is not in P ± . 
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In general, if f e P d then by induction we know that all substitutions 

f(x l7 ...,Xi_ 1/ 0,x i+ i 7 ... 7 x d ) 

are in P^j. We may assume that f(0,X2, • • - ,x d ) has all positive terms, and 
then use overlapping polynomials as above to conclude that all terms are pos- 
itive. □ 

P^ os is closed under interlacing. 

Theorem 10.33. Iff e P^", 9 e P d and f<g then g is in F p ° s . 

Proof. The proof is by induction on the number of variables. For P pos the 
result is true. Write 

f(x) =f (x 2 , ...,X d ) +fl(x 2/ ...,X d )Xl H hf n (x 2 , . ..,X d )xj\ 

g[x) = go(x 2 ,...,x d ) + gi(x 2 ,...,x d )xi + h g n -i[x 2 , . . . ^^x^' 1 . 

Since ft < gt for all i, and all gt are in P^ os 1 by induction, it follows that the 
coefficients of gt are either all positive, or all negative. Since g e P d the homo- 
geneous part g H has all positive coefficients, and so each gt has some positive 
coefficient. This shows that g e P d ° s . □ 

The positivity of substitutions implies that a polynomial in P d is in P^ os . 
Note that if f (x) is positive for all positive x then it is not necessarily the case 
that all coefficients are positive: consider 1 + (x — l) 2 . 

Lemma 10.34. Iff (x) e P d and f (a) is positive for all a e (R + ) d then f e P d ° s . 

Proof. We prove this by induction on d. If d = 1 and f (x) has positive leading 
coefficient then we can write f = aJ7(x + r^). Since this is positive for all 
positive x it follows that no n can be negative, and therefore all coefficients 
are positive. 

In general, write f(x,y) e P d +i as Y. ^ii x )v x - Now for any a e (R + ) d the 
hypothesis implies that f (a, y ) is positive for all positive y . Thus all coefficients 
ft (a) are positive, and so by induction all coefficients of f (x, y ) are positive. □ 

The next two results show the relation between homogeneous polynomials 
and the positivity condition. 

Lemma 10.35. Iff is a homogeneous polynomial then 
fep d iff 

f(l,x 2 x d )eP^. 

Proof. The assumption that f is homogeneous is equivalent to the equality 
f = f H . The result now follows from the definition of P^ os . □ 

Lemma 10.36. If f e P d (n) let F be the corresponding homogeneous polynomial. 
We construct F by replacing each monomial x 1 by x I x^{ 1 ' . A necessary and sufficient 
condition that F <E P d+ i is that f e Pj 0S . 
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Proof. All coefficients of F H are positive. If F e Pd+i then F H = F. Since f 
is found by substituting 1 for Xd+i in F, all coefficients of f must be positive. 
Conversely, since the coefficients of F are the same as the coefficients of f we 
see that the assumption on f implies that F satisfies positivity. □ 

We know that the terms of maximal total degree of a polynomial in Pd 
constitute a polynomial that is also in Pd- If f G P^° s then there is a stronger 
result: 

Lemma 10.37. Suppose f e P^° s (n) and h T is the polynomial consisting of all terms 
of total degree r. Then h r £ F p d s {r) for 1 ^ r ^ n. 

Proof. If we homogenize then 

F(x, z) = g n + zgn-i H h z n g 

Since g r is the coefficient of z n ~ r the conclusion follows. □ 

Here is a condition for interlacing in P^ os that only involves positive linear 
combinations. 

Lemma 10.38. If f, g S P^° s and deg(f) > deg(g) then f <. g if and only if these 
two conditions hold 

(1) f + age P^ d os for all a>0 

(2) f + agxt G F^" for all a > and some i 



Proof. Assume that f <Jg. Both left hand sides are have homogeneous part 
equal to f H plus perhaps a contribution for g H , so they both satisfy the homo- 
geneity conditions. We need to verify substitution. If we consider substituting 
for all but the Xi variable, and restrict a to be non-negative, then both f (x a ) and 
g(x a ) are in P pos . The conclusion now follows from Lemma [2.13l 

Conversely, if (1) and (2) are satisfied then we can again use Lemma [2.131 
to see that f<g. □ 

Just as for one variable we can define a polar derivative. If f e P^ os 
then write f = Y. Q i xI - The corresponding homogeneous polynomial F = 
Y_ aix'y 11 - - ' 1 ' is in Pd+i, and if we differentiate with respect to y and then set 
y — 1 we get the polar derivative 

^Jn-HDatX 1 (10.4.1) 

which can also be written in the more familiar fashion 

9"f 91" 

nf-xi- x n — (10.4.2) 

OXi ox n 

We will return to the polar derivative in Section ["l0.15l 

Rather than forming the homogeneous polynomial with respect to all the 
variables, we can homogenize some of them. 
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Lemma 10.39. J/f(x,y) = £ a^xjy 1 e FT +e {n) then 
g(x,y,z) = £ ajMyV 1 - 111 e P d+e +i(n) 

Proof. g H = g, so g satisfies homogeneity condition. Since 

g(x,y,z) = z n f(x,^) 
z 

it follows that g satisfies substitution. □ 

We have a more general form of polar derivative: 
Corollary 10.40. J/f(x,y) = £ ai(x)y' e P^+ e (n) then 

f(x,y)< ^(n-UDaztx)/. 

10.5 Coefficient inequalities for P^ 055 , hives, and the Horn 
problem 

Every polynomial in determines a homogeneous polynomial in P3. If 
f(x,y,z) e P3 is such a homogeneous polynomial then the three polynomi- 
als f(x, 1, 0), f(0,x, 1), f(l, 0,x) have all real roots. We call these the boundary 
polynomials of f . We consider the converse: 

The boundary polynomial problem: 

Given three polynomials fi(x), fz[x), ij,{x) with all real roots, when 
is there a polynomial f(x,y,z) G P3 whose boundary polynomials 
are h,h,h? 

Surprisingly, this is related to a famous problem involving matrices. 

The additive Horn problem: 

Fix an integer n, and let a, |3, y € K n . Is there a triple (A, B, C) of 
Hermitian matrices with A + B + C = 0, and eigenvalues a, (3,y? 
If so, we say Horn's additive problem is solvable for a, B,Y- I161| 

We begin by looking at a particular example, and then establishing in- 
equalities for coefficients of polynomials in F^" 3 . Taking logs of these inequal- 
ities gives a structure called a hive. We then recall the connection between 
hives and Horn's problem to get information about the boundary polynomial 
problem. 

Suppose that we are given a polynomial of degree n in F^ os that is homo- 
geneous: 

f(x,y,z) = (z + x + y)(4z + 2x + y)(2z + 3x + y)(3z + 5x + y) 
= 24z 4 + 112 xz 3 + 186 x 2 z 2 + 128 x 3 z + 30 x 4 + 50 y z 3 + 175 xy z 2 + 190 x 2 y z+ 
61 x 3 y + 35y 2 z 2 + 80xy 2 z + 41 x 2 y 2 + 10y 3 z + 11 xy 3 + y 4 . 



CHAPTER 10. POLYNOMIALS IN SEVERAL VARIABLES 



323 



We can represent the terms of f (x,y) in a triangular diagram. 



y 4 






10y 3 z- 


— llxy 3 




35u 2 z 2 - 


— 80xu 2 z — 


- 41 x 2 y 2 


50 yz 3 - 


— 175 xyz 2 — 


190x 2 yz 


24 z 4 — 


— 112 xz 3 


■^186 x 2 z 2 



61 x 3 u 
128 xV 



30 x 4 



The three boundary polynomials of f occur on the boundary of this dia- 
gram, and are 

f (x, 1, 0) = 30x 4 + 61x 3 + 41x 2 + llx + 1 
f (0, x, 1 ) = x 4 + 10x 3 + 35x 2 + 50x + 24 
f(l,0,x) = 24x 4 + 112x 3 + 186x 2 + 128x + 30 

If we only list the coefficients then we get the triangular diagram 
1 



10 

35; 

so- 



il 

-80- 
175 



-41 ^ 
190 



61 



24 — 112 — 186 — 128 — 30 
This array of numbers has the property that the coefficients in each of the 
three kinds of rhombus satisfy be ^ ad: 

b — d a — b a 



Proposition 10.41. For any homogeneous polynomial in Pj 04 , construct the trian- 
gular array of coefficients. In every rhombus formed by two adjacent triangles the 
product of the two numbers on the common edge is at least the product of the two 
remaining vertices. 

Proof. The inequalities corresponding to the three different kinds of rhombus 
follow by applying the inequalities for polynomials in P2 (Theorem 19. 1131 1 to 
f(x,-y,l),f(x,l,z), andfCl^z). □ 
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Suppose we choose f(x,y) = Y. a u xX V^ in P?° S- Since all coefficients are 
positive, we can define real numbers Kij^ = log(aij) where i + j + k = n. If 
we write the inequalities of Prop osition 1 1 .4 ll in terms of the hij 



Q i+l,j a i,j+l ^ Q i,j a i+l,j+l 
Q i+l,i-l Q i,j-1 ^ Q i,j a i+l,j-2 
Q i-l,i+l a i-l,j ^ Q i,j a i-2,j+l 

then they translate into inequalities about the risk's: 



hi+i,j,k-i + h-i,j+i,k-i ^ hy,ic + hi+i,j+i,k-2 
hi+ij-i,k + htj-i^+i 5^ rM,j,k + hi. + u_2,k+i 

hi-l,j+l,k + hi-l,j,k+l ^ hi,j,]c + hi._2,j+l,k+l 

This is precisely the definition of a /zz'ue 11611 . There is a wonderful theorem 
that relates hives to the Horn problem. The boundary of a hive consists of the 
three vectors (vi,V2,V3) formed by taking differences of consecutive terms on 
the boundary. In terms of the atj 's this is 



Vi = (h n _i4 ; o — h n ,o,0/ h n _2,2,o 

v 2 = (ho /Tt _i4 — ho /R ,0/ ho, n -2,2 
v 3 = (hl,o,n-l — hfl^n./ h2 / 0,n-2 

then 



— hn-1,1,0/ ' ■ ' / ho, n ,0 — hi^n-i^) 

— ha n _i,i, • • • /ho,o,n — W),l,n-l) 

— hi^n-l/ ' ' ' / h n/ o,0 — ^-n-l,0,l) 



The Hive Theorem: Horn's problem is solvable for (vi,V2,V3) if and only if 
(vi,V2, V3) is the boundary of a hive. fi"6"Tl 

We now determine the boundary of the hive corresponding to f . 



/ dn-14 dn-2,2 
Vl = l0g ~7, ' l0g n ' 

= the logs of consecutive coefficients of the boundary polynomial f (x, 1, 0) 

/, O-O.n-1 , d0,n-2 
V 2 = log ,log , 

\ CL(),n CL0,n-l 

= the logs of consecutive coefficients of the boundary polynomial f (0, x, 1) 

( i a l,0 1 a 2,0 

v 3 = log ,log 

V a ,o ai (0 

= the logs of consecutive coefficients of the boundary polynomial f (1, 0,x) 
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Corollary 10.42. Ifi[x,y,z) £ P3 is homogeneous, and v\,V2,vs are the logs of the 
consecutive coefficients of the boundary polynomials, then Horn's problem is solvable 
for v 1 ,v 2 ,v 3 . 

Although polynomials in determine hives, the converse is not true: 
here is an example of a hive that does not come from a polynomial. We don't 
write the hive, but rather the coefficients of polynomial; it's easy to check that 
all of the multiplicative rhombus inequalities are satisfied. If the array were 
determined by a polynomial in P^" 3 then the polynomial corresponding to the 
coefficient of y, 11 + 8x + 3x 2 , would be in P, but it isn't. 



1 




6 — 11 — 6 — 1 



Horn's problem is connected to the conjecture (now solved - see (64)) 
that specifies necessary and sufficient conditions for vectors a, (3,y to satisfy 
Horn's problem. We can use this to get necessary conditions for the boundary 
polynomial problem. Here is one example. We first order cc, |3, y: 

a :ai > 0L2 ^ • • • ^ a n 
P:Pi ^ P 2 > ••• > Pn 
T Ti > 72 > ■ ■ ■ > Tn 

Weyl's inequalities are 

at + |3j + y n +2-i-j ^ whenever i + j — 1 ^ n 

We now apply this to our problem. For our particular polynomial we have 
that 



/ 61 41 11 1 
vi^log -,log -,log -,log- 

/ 10 35 50 24\ 
v 2 =(bg T ,log -,log -,log -j 

/ 112 , 186 , 128 , 30 

V 3=^Og 24, bg ^log m ,** m 

Notice that they are all decreasing - this is true in general, and is a con- 
sequence of Newton's inequalities. Since these quotients are decreasing, they 
are in the correct order to apply the Weyl inequalities. Using the values of 
vi,V2,V3 from above, the following is a necessary condition for the ay to be 
the coefficients of boundary polynomials. 
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log 



, CLo, n -i . , Q n+2-i.-j,0 . n 

log h log > 

a 0,n-i+l Qa+2-i-j+l 
> 1 



dn-i+U-l OLo /T t-i+l Ga+2-i-j+l 



10.6 Log concavity in Pf os 

We say earlier ((p. |118l l) that the graph of the log of the coefficients of a poly- 
nomial in P pos is concave. We now prove a similar result for polynomials in 
P2° s . Given a polynomial Y_ ckj* 1 !)' m P;? 08 , the set of points 

{(i, j,log(aij) ) | ayxV is a term of f } 

is called the graph of the log of the coefficients of f . We view this as the graph 
of a function defined on the points in the plane corresponding to the expo- 
nents appearing in f . 

For example, the diagram below displays the log of the coefficients of a 
polynomial in P^. The coordinates of the point in the lower left corner are 
(0, 0), since 24 is the constant term. The coordinates of the rightmost point are 
(4, 0), and the uppermost point's coordinates are (0,4). 
logl 



log 10 — log 11 



log 35 — log 80 log 41 



log 50 — log 175 — log 190 — log 61 



log 24 — log 1 12 — log 186 — log 128 — log 30 



Proposition 10.43. If f e V p 2 then the graph of the log of the coefficients is concave. 
Every triangle in the planar diagram determines a triangle in the graph, and the graph 
lies on one side of the plane containing that triangle. 

Proof. We will prove the second statement, for that is what we mean by saying 
that the graph is concave. Note that it suffices to show that the plane through 
a triangle lies above all of the three adjacent triangles. We now show that this 
is a consequence of the rhombus inequalities. 
Consider the triangle T with solid lines 

(r,s + l,loga T/S+ i) (r + l,s + l,loga r +i, s +i) 



(r,s,log a r , s 



(r + l,s,log a r+ i /S 
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The plane in R 3 containing T is 

(log Q r+ i, s - log Q r , s )X + (log Q r ,s + 1 - log Q r ,s )V - Z- 

(r log a r+ i, s + s log a r , s +i - (r + s + 1) log a r , s ) = 

The expression on the left side takes negative values for points above the 
plane, and positive values for points below the plane. If we evaluate it at the 
vertex (r + 1, s + 1, a r +i, s +i) of the dotted adjacent triangle we get 

log a r+ i, s + log Q r , s +i - log a T , s - log a T+1 , s+ i = 

a r+ i jS a TjS+ i 
log ^ 

a r,s 1r+l,s + l 

where the last inequality is the rhombus inequality. A similar computation 
shows that the plane lies above the other two adjacent triangles, so the graph 
is concave. 

□ 

Corollary 10.44. Suppose that f = Y. a y xl U ' € • For all non-negative i, ), r, s 
for which a r , s ^ we have 

a r + l,s \ ( 1r,s+l V "> a r+i,s+j 



Proof. Evaluate the plane through the triangle 

{(r,s,loga r , s ), (r + l,s,loga r+ i, s ), (r,s + l,log a T/S+1 )} 

at the point (r + i, s + j, log a r+ j. /S+ j ). The result is positive, and simplifying 
yields the inequality. □ 

Remark 10.45. If qo,o = 1/ then the inequality shows that the coefficients of f 
satisfy a simple bound: 

a lo a ai > a tj- 



10.7 Extending two polynomials 

The boundary polynomial problem asks when can three polynomials be real- 
ized as the boundary polynomials of a polynomial in Pj 06 . We now ask the 
simpler question 

Given two polynomials f (x), g(x) in p^os^ (j escr j]3 e j- ne structure of 
all F(x,y) in P^ 05 such that F(x,0] = f(x) and F(0,y) = g(y). We 
say that F is an extension of f, g. 



There is a trivial requirement that guarantees the existence of an extension. 
This result is similar to Lemmas 19.1031 and 19 . 1 051 
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Lemma 10.46. Assume that f , g e pP° s . There is an extension off, g if and only if 
f(0) = g(0). 

Proof. If there is an extension then F(0, 0) = f(0) = g(0). Conversely, we 
may assume that f(0) = g(0) = 1. . We can write f(x) = f7(l + r^x) and 
g(x) = flfl + six). The product below is the desired extension. 

na+nx + siy) 

□ 

Suppose that F is an extension of f, g. We know that we can write F(x, y ) — 
I + xDi + UD2I where Di and D2 are positive definite. Moreover, since f (x) = 
I + xDi| we know the eigenvalues of Dj: they are the negative reciprocals of 
the roots of f. Similarly, we know the eigenvalues of D2. We may assume that 
Di is diagonal. Write D 2 = OAO* where A is a diagonal matrix consisting of 
the eigenvalues of D 2 , and O is an orthogonal matrix. Consequently, 

There is an onto map from O (n), the group of orthogonal matrices, 
to the extensions of f, g. 

Since the orthogonal matrices in 0(2) are easy to describe we can com- 
pletely determine the extensions for a quadratic. 

Lemma 10.47. Suppose f = (1 +r\ x)(l + +r 2 x) and g = (1 + si x)(l + S2X). All 
extensions off, g are of the form below, where W lies in the interval determined by 
rjsi + r 2 S2 and riS2 + r 2 si. 

sis 2 u 2 




(si + s 2 ) y Wxy 




1 [n + r 2 ) x nr 2 x 2 

Proof. If the orthogonal two by two matrix is a rotation matrix, the general 
extension of f , g is given by the determinant of 

(1 0\ fn 0\ f cos 9 sin9\ /si 0\ /cos 9 -sin9\ 
^0 l) +X \0 r 2 ) +v \-sme cosgJ^O s 2 y ^sin9 cos9j 

If we simplify the coefficient of xy in the determinant we find it to be 

\ [(ri + r 2 )(si + s 2 ) - (ti - r 2 )(si - s 2 ) cos(29)] 

The two values at cos 29 = ±1 yield the bounds of the lemma. We get the 
same result if the orthogonal matrix does not preserve orientation. □ 
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The two ends of the interval for W are the values that arise from the two 
different products 



(1 
(1- 



-T]X + siy)(l 
rjx + s 2 y)(l - 



- r 2 x ■ 
r 2 x - 



-S2V) 

■siy) 



In general, if the polynomials are 0(1 + r i x ) an< ^ 0(1 + Sty ) then there are n! 
different products: 

where a is a permutation of 1, . . . , n. If n is odd then O (n) is connected, so 
the set of all extensions is connected. In the case n = 2 we saw that this set is 
convex, and the boundary consists of the two different products. This may be 
the case in general - see Question 1 1231 

It follows from the above that if the quadratic f has all equal roots then 
there is a unique extension. This is true for any n. 

Lemma 10.48. If g S P pos (n) then g(x) and (x+ a) n have a unique extension. 

Proof. Suppose f(x,y) is an extension, so that f(x, 0) = (x+ a) n and f(0,y) = 
g(y). If we write f (x,y) = Y fiM y 1 then f = (x + <x) n and fo <fi, so there 
is a constant ai such that fi(x) = ai(x+ ot) n . Continuing, we find constants 
at so that ft(x) = ch(x + a) n ~\ Since f(0,y) = Y a i a n_1 y 1 we see that g(y) 
uniquely determines the at. □ 



Iff 



P2° s is an extension of two polynomials, then 



&oo dio 



Y atjxV G 

only has one term that depends on the extension. We assume that f = |I + 
x D + y C| where D a diagonal matrix with entries {di}, and C = (cy ) = GEO* 
where E is diagonal with entries {et}. 

Since every entry of y C has a factor of y , the only way we can get a term xy 
in the expansion of the determinant is to take x and y terms from the diagonal. 
Thus, the coefficient an of xy is 



Now aoo = 1/ <Ho = Y. di an d aoi = Y. z n> so we conclude that 



If we write 



Oil 



aio 
aoo 

then Cii 



an 
aio 



Y_ diCu 
Y e i °ij an d consequently 



aio a n 
aoo a io 



4 



die,- of j 



Now the maximum of Y diGjofj over all orthogonal matrices is Y d(e{ 
where {d(} is {di} written in increasing order, and {e(} is {ei} written in increas- 
ing order (Lemma llO. 491 . The minimum is Y &\ e i where {e"} is {ej written 
in decreasing order We thus can make a more precise inequality: 
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Y_«> 

When n = 2 we see that 



olio an 
loo a 10 



qio an 
aoo aio 



= cos 2 (9) (die! + d 2 e 2 ) + sin 2 (9) (die 2 + d 2 ei) 



which shows that the determinant takes every value in the interval deter- 
mined by the largest and smallest products. 

We now prove the Lemma used above; it is a generalization of the well- 
known rearrangement lemma. 

Lemma 10.49. If di,...,a n and bi,...,b n are increasing sequences of positive 
numbers then 



max 

o 



Y °?j a i b ) = Y Q t b t 



y=i t 
roftere f/ze maximum is over all nbyn orthogonal matrices = (oy). 

Proof. If A = diag(ai, . . . , a n ) and B = diag(bi, . . .,b u ) then 
^2 o 2 j a t bj = trace (OM-OB) 

Since orthogonal matrices are normal, we can apply Bourin's matrix Cheby- 
shev inequality |2T1 page 3] to conclude that 

trace (O^OB) < trace (O t OAB) = trace (AB) = Y_ a t b t- 



□ 



10.8 Symmetric Polynomials 

In this section we begin an investigation of the symmetric polynomials in 
Pd- We say that f (x) is symmetric if f (x) = f(crx) for all permutations cr of 
xi,...,Xd. For instance, the elementary symmetric functions are in Pd, and 
the product of any two symmetric functions in Pd is again a symmetric poly- 
nomial in Pd- 

In two variables a homogeneous polynomial f(x,u) is symmetric if and 
only if it has the form 

a y n + a 1 xu rt_1 + a 2 x 2 y n ~ 2 H h a 2 x n ~ 2 y 2 + aix n_1 y + a x n 

The roots of f consist of paired roots {r, 1/r}, along with some ±l's. In degree 
two such a polynomial is 

I 

y 2 + (r + -)xy + x 2 
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It's immediate that the symmetric homogeneous polynomials in P2(2) are of 
the form x 2 + ctxy + y 2 where a ^ 2. We can also characterize the symmetric 
homogeneous polynomials of degree two in three variables. 

Lemma 10.50. The symmetric homogeneous polynomials in P${2) are of the form 

x 2 + y 2 + z 2 + ot(xy + xz + yz) where a ^ 2 
Proof. Setting z = implies x 2 + ax + 1 e P, and thus a ^ 2. Conversely if 
D = (r \ Q = 1 fVr 2 + r + 1 -y/i 



v 1/r; r + lV Vr Vr 2 + r + 1 

then the determinant f of xl + yD + zODO* equals 

x 2 + y 2 + z 2 + (r + l/r)(xy + xz + yz) 

Since D is positive definite and O is orthogonal it follows that f e P 3 . Finally 
note that r + l/r takes on all values > 2. □ 

We can show that the symmetric homogeneous polynomials in Pd(2) fol- 
low the same pattern, but they don't always have a determinant representa- 
tion. 

Lemma 10.51. The symmetric homogeneous polynomials in Pa (2) are of the form 

d 

y~ x? + a Xiy where a > 2 
1 i<j 

Proof. We use the definition of P d- The homogeneous part is certainly positive, 
so we check substitution. Since we are considering a quadratic, we only need 
to check that the discriminant of x d is non-negative. Subsituting at for X| we 
get 

d-l N 2 /d-1 

.2 



/ d-i 

(a-2) (a + 2)^ a 2 +2a^ Q ^ a i 

^ 1 i<j 



The factor a — 2 is non-negative by hypothesis, and the second factor is a 
quadratic form in a\, . . . , Qd-i with matrix Q is 

fa. + 2 a. a . . A 
a a + 2 a 

a a a + 2 ... = 21 + aj 

V ; i i ••/ 

where J is the all one matrix. Now the eigenvalues of J are d — 1, 0, . . . , 0, so 
the eigenvalues of Q are a( d — 1 ) + 2, 2, . . . , 2. Thus Q is positive definite, so 
the discriminant is non-negative, and our polynomial is in P d . □ 
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Lemma 10.52. The polynomial f (x, y, z, w) given below has no representation of the 
form \xl + yDi + zD 2 + wD 3 | where D!,D2, D 3 are positive definite two by two 
matrices and a > 2. 

f (x, y, z, w) — x 2 + y 2 + z 2 + w 2 + a(xy + xz + xw + yz + yw + zw) 

Proof. Since f (x, y, 0, 0) = x 2 + axy + y 2 we see that we must have a ^ 2, and 
that we can write f(x,y, 0,0) = (x + r)(x + 1/r) where r ^ 1. We may assume 
that Di is diagonal, with diagonal r, 1/r, and so a = r + 1/r. 

We proceed to determine D2. f is symmetric for all 24 permutations of the 
variables. Thus |I+xDi| = |I+xD 2 |, and therefore D 2 has the same eigenvalues 
as Di. We write 



D 2 



cos t sin t\ fr \ / cos t — sin t 
-sint cost/ \0 1/r J Isint cost 



Since f (1, x, 0, 0) = f (0, x, 1, 0) we can equate the coefficients of x: 

r + - = 2cos 2 (t) + ^ L — 

r \ L 

Solving this equation yields 

COSt = ±- \/ 1 + 1" + T 2 

1+T 

1 

sint = ±— — ijr 

1+T 

Consequently there are exactly four choices for Di and for D 2 : 



1 / eiVl + r + r 2 62^ 



r + 1 V -e 2 Vt eiVl + r + r 2 



n 1 ^Vl+r + r 2 f 2 Vr" 
u 2 = 



r + lV -f 2 y/t fjVl+r + r 2 

where e\, ezJ\,^2 are ±1. If we equate the coefficients of x in |xDi + D 2 ] 
|xDi + I| we get 

~ r (e 2 f2r+e 1 f 1 (l+r + r 2 )) =-^71+7+^2" 



(1+rp 11 1+T 

It's easy to see that this equation has no solution for r ^ 1. □ 

A symmetric homogeneous polynomial F(x, y, z) in P 3 (3) has coefficient ar- 
ray 1 

a a 

aba 

1 a a 1 
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We first show that for any a ^ 3 there is at least one b such that F(x, y, z) e 
P3. If we take a = a + 1 and b = 3a then a ^ 2. Now 

(x + y + z)(x 2 + y 2 + z 2 + a(xy + xz + yz)) e P 3 

and has coefficient array 

1 

1 + a 1 + a 

1 + a 3a 1 + a 

1 1+a 1+a 1 

Newton's inequality applied to the first and second rows yield a ^ 3 and 
b ^ 2a. However, we can get more precise results. As above, the roots of the 
bottom row are — r, — 1/r, —1, so a = 1 + r + 1/r. We assume r > 1. Since the 
second row interlaces the first, we have that 

-b - Vb 2 -4a 2 -b + Vb 2 -4a 2 
-t< 2a * ~ 1/r 

These inequalities yield 

a(r + 1/r) ^ b ^ 2a 

Note that if r = 1 then a = 3 and b = 6. This is (x + y + z) 3 , which is the 
unique extension of the bottom row and the left hand column. These are not 
the best bounds; empirically we find that the intervals for b appear to be 



r=2 
r=3 



b e 
b e 



29 16 
I 4 ' 2 > 



r=5 be ( 



90 

1 9 
326 

~75 



38, 
142, 



When is Y. a i ff i e MA d , where at is the i'th elementary symmetric poly- 
nomial in xi, . . .,Xd? If we set all Xi equal to x, then Y. a i(i) xl e P- We are 
not able to prove the converse; as usual we can't get the final factorial. 

Lemma 10.53. IfY. itX 1 € P then 

1. Y. Q itfi e M4 d . 

2. 21 aiilffi e M/A d . 

Proof. The first sum is the coefficient of y n in 

X a ^ i (n^ +xi )) =L a ^ i -L« n_i<ri 
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while the second equals 

'2> D i)5> wl * 



y=o 

□ 

Example 10.54. We give an example that supports the possibility that the con- 
verse is true. Consider the question 

For which m is xyz — x — y — z + m = 03 — 01 + m. do £ MA3? 

We find 



x 3 - x + m S P =^> |m| < V4/27 

x 3 /6 - x + meP =^> |m| < ^/%J9 



V \0 

We know from Exampl dll.611 that the necessary and sufficient condition is 
that |m| < 2. 

10.9 Linear transformations on P d 

We have looked at linear transformations that map Pd to itself. In this section 
we look at mappings that either delete variables or introduce new variables. 

The repeated application Theorem 110.211 with the linear transformation 
x 1—* ax + b yields 

Corollary 10.55. Choose a\, bi, . . . , ad, bd G K where all of the en's are positive. 
The linear transformation below maps Pd to itself. 

f(xi,...,x d ) i-» f(aixi +bi,... ; adX d +bd). 
We can allow some of the at to be zero. 

Lemma 10.56. If f (x) e Pd and exactly r terms of a\,..., a d are non-zero, and they 
are positive, then f (aixi, . . . , adXd) € P r 

We can also add new variables. 

Lemma 10.57. Iff e Pd(ff-) then g — f (xi + yi,x 2 + y2, . . . ,x d +y d ) is in P2d(i"0- 

Proof. Certainly g satisfies substitution. Every term x^J 1 • • • x^ d in f gives rise to 
(xi + yi) Xl • ■ ■ (xd + yd) 1 " 1 in g. This shows that 

g H (x) =f H (xi+yi,...,x d + y d ) 
Since f satisfies the homogeneity conditions, so does g. □ 
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Lemma 10.58. Iff (x) £ P d then the polynomial g = f(x,x,%3, . . . ,x d ) *s m P d -i- 
AZso, f(x,x, . . .,x) € P. 

Proof. If d is two then this is Lemma 19.321 If d is larger than two then g sat- 
isfies x d -substitution. Since f(x, x, X3, . . . ) H = f H (x, x,X3, . . . ) it follows that g 
satisfies the homogeneity condition. The second assertion follows by induc- 
tion. □ 

We can replace one variable by more complicated expressions. 

Lemma 10.59. If f (x) £ Pd, Pel, and a\, . . . , a d -i are non-negative then 

g(xi,. . .,x d -i) = f(xi,x 2 ,. . .,x d _i,aixi H V a d -ix d -i + (3) £ P d -i 

Proof. By induction on d. The case d = 2 is Lemma 110.581 Since g satisfies 
the positivity condition it remains to show that g satisfies substitution. If we 
substitute Xi = at for all xt except xi and Xd then f (xi, (X2, . . . , <x d -i, x d ) £ P2. 
Consequently f (x\, 1x2, ... , <x d -i, aiXi +b) is in P for any choice of b. Choosing 
b = a 2 a 2 + • • ■ + a d _ia d _i + |3 and observing that 

g(xi, a 2; . . .,a d _i) = f (xi, <Xz, . . . , a d -i, aix a + a 2 a 2 H V a d _ia d _i + |3) 

it follows that g satisfies substitution. □ 

We now use these results to show how very general linear combinations 
preserve P. 

Theorem 10.60. Suppose f(x) £ P d . If Mis a d by d matrix with positive entries 
t/ienf(Mx)eP d . 

Proof. Let M = (tOij ) and define a polynomial in d 2 variables x-y 



Repeated applications of Lemma 110.561 and Lemma 110.571 show that g is in 
P d 2. We now use Lemma [10.581 to identify all pairs of variables Xij and xtk- 
The resulting polynomial is exactly f ( Mx) . □ 

Lemma 10.61. If f, g £ P d and f = Y.i a i xI then for any index K 

,(/) 

Proof. Using the Taylor series for g we get 



ai 7T 

I+J=K ' 



f(y)g(x + y) = V a I x I ^^ T g(x) 
The coefficient of y K is the sum in the conclusion. □ 
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10.10 The graph of polynomials in P d 

In P 2 we used the geometry of the graph of f(x,u) e P 2 to show that 
f(x,x) E P. Now we use the fact that if f(x) e P d then f(x,x, ...,x) e P to 
get information about the geometry of the graph of f . 

For clarity we'll take d = 3. Choose f(x) e PdW- The graph Gf of f is 
{{x,y,z) I f(x,y,z) = 0}. Choose any point v = (a,b,c) on the plane xj + 
x 2 + x 3 = 0. Since f(x — v) = f(x — a,y — b,z — c) is in P d , the equation 
f(x — a,x — b,x — c) =0 has n solutions. Denote the i-th largest solution by 
Si(v), and letWi(v) = (si(v) — a, Si(v) — b, St(v) — c). By construction we know 
that f(wi(v)) = 0, so Wi(v) is a point on the graph of f. Consider the map 
v 1 ► Wt(v). This is a function from the plane xi + x 2 + X3 = to the graph 
of f that is 1 — 1, and continuous since solutions to equations are continuous 
functions of coefficients. Moreover, since the union of all lines of the form 
{-v + t(l, 1, 1)} is a partition of R d , every point on the graph of f is of the form 
Wi ("v) for a unique y, and perhaps several i's. Thus, the graph Gf is the union 
of n subsets Ga of K d , each homeomorphic to K d_1 . These subsets might 
intersect. 

In addition, since all coefficients of f H are positive, there is an a such if 
|x| > a and all coefficients of x are positive then f (x) > 0. Similarly, we may 
assume that if all coefficients are negative then f (x) ^ 0. 

Let K + be the set of all points with all positive coefficients, R~ the set with 
all negative coefficients, and S be the ball of radius ex. The previous paragraph 
shows that the graph Gf separates M + \ S from R~ \ S. 

We have seen that if a, b, c are positive and f e P3(n), then f (ax + a, bx + 
|3, cx + y) is in P(n). This is a special case of the following observation: 

If a curve goes to infinity in all coordinates in R + and also in R~ 
then it meets the graph of f in at least n points. 

Here's a variation: suppose that the line £ is parallel to a coordinate axes. 
For instance, £ could be {(a, b, c) + t(l, 0,0) 1 1 e K}. Then, £ meets the graph 
in n points. To see this, notice that there is a sufficiently large T such that 
the line segment from (a + T, b, c) to (a + T, 0, 0) does not meet Gf. Neither 
does the ray {(a + T + t, 0, 0) | t ^ 0}. Similarly analyzing negative values 
we can find a corresponding S and conclude that the piecewise linear curve 
consisting of the segments 

{(a + T + t,0,0) |t>0} 
(a + T,b,c) - (a + T,0,0) 
(a + T,b,c) - (a — S,b,c) 
(a-S,b,c) - (a-S,0,0) 

{(a-S-t,0,0) |t>0} 
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meets Gf in n points, and consequently the line {(a + t,b,c) [ t € R} meets Gf 
in n points. 

If f G P d ° s then x € M + implies that f (x) ^ 0. Any curve from a point in R + 
that goes to infinity in R~ must intersect every Si. 



10.11 Differential operators 

In this section we introduce differential operators on P d of the form 

3 9 3 

ox oxi 3x d 

where f is a polynomial in d variables. To simplify notation, we will some- 
times write 3 X for In Lemma |13.22| we will show that f(3 x ) maps Pd to 
itself. This requires the consideration of exponential functions. If we restrict 
ourselves to polynomials then 

Lemma 10.62. Suppose that f is a product of linear terms 

f = Y\^° i + aiiXl Q di*d) = Yl^ i + 31 ' X ) 

i i 

where all the coefficients are positive and at = (an, • ■ • , a-dO- The map f (-J^) is a 
linear transformation from P d to itself. 

Proof. It suffices to assume that f has a single factor. If g is in P d , then any 
derivative of g has total degree less than g. If the constant term of f is non- 
zero, then (f (-g^)g) H is a constant multiple of g H . If the constant term is zero 
then (f (■^■)g) H is a positive linear combination of the partial derivatives of g H 
which implies that all of its coefficients are positive. In either case (f(^-)g) H 
satisfies the positivity condition. 

Substitution is also satisfied since since all the partial derivatives interlace 
g, and and are combined with the correct signs. 

□ 

The next two lemmas are generalizations of Corollary 19.931 and 
Lemma T9.95I the proof are the same. 

Lemma 10.63. If g € P pos , f (x) e P d then g(-3xi3x 2 ) f(x) e P d . 
Lemma 10.64. If g e P pos , f (x) e P d then g[x r g|j) f(x) e P d . 

Lemma 10.65. Let c 1 — {y\, . . . ,y d ) be vectors of all positive terms, and b l — 
( |3J, . . . , |3 d ) vectors of all non-negative terms. The operator 



maps P d to itself. 
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Proof. It suffices to verify it for one factor. Choose g € Pd and set 

h= ( C ' X - b 'l) 9 - 

Since h H = (c • x) g H we see that h satisfies the homogeneity condition. Set k = 
(b • ■$-} g. Since derivatives interlace and all coefficients of b are non-negative, 
we find that g < k. To verify substitution we must show that c- (x?)g(x?) — k(x?) 
is in P for any a € M d . This follows from Lemma [1.20| 

□ 

Corollary 10.66. Let c 1 = (y\, ...,y\) be vectors of all positive terms, and b 1 = 
(|3J, . . . , $\) vectors of all non-negative terms. The map 

g 

fxgwf(c-x-b-r-) g(x) 
ox 

defines a map P x Pd — ► Pd- 

These last results were generalizations to d variables of the corresponding 
one variable results. The next result is strictly a property of more than one 
variable. 

Theorem 10.67. If f (x,y) = Y. a i( x )y 7 nas ^ ne property that f(^, -§-) maps Pd+e 
to itself then all coefficients aj(x) determine operators a-i(^) that map Pd to itself. 

Proof. Choose g G Pd/ a large integer N, and let y N = ■ ■ ■ . The action of 
f(£,j|)on g (x|y~is 

-£«.(!'•« (^ N ) 

Since N is large, all the y terms ^ry N are non-zero and distinct. Since the left 
hand side is in Pd+e/ all its coefficients are in Pd, and hence ai(3 x )g(x) is in 
Pd- Thus ai(dx) maps Pd to itself. □ 

If we choose f to be a product of linear terms, then the coefficients are 
generally not products. This shows that there are polynomials h that are not 
products such that h( 4- ) maps Pd to itself. 

We now generalize Lemma 19.971 and find a Hadamard product. 

Lemma 10.68. IfY. fiMu 1 G P d+ i and £ a i x i G P pos then 



aifitxjy 1 G Pd+i 



(10.11.1) 
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Proof. From Lemma H0,64l we know that 

£ai(ua*) l £f5(x)z* eP d+2 

and the coefficient of z° yields dlO.ll.lt . □ 

In the following lemma we would like to replace exp x f rev with f, where 
we define exp x = exp Xj • • • exp x ( . 

Lemma 10.69. Suppose that f , g € Pd(n.) an d write f = ^m< n <ijx f . Ifi rev — 
Y_ ajx n_f then 

(ex Px n^lgePd 

Proof. The coefficient of y] 1 • • ■ in the product of f (y) and the Taylor series 
of g(x + y) 

(Z^y 1 ) (Zfl w M£ 

equals 



and the latter expression is exactly exp xj ■ • • exp x ( f rev (-^) g- d 

Next we generalize Corollary l7.19l and (|7.4.2) >. 
Lemma 10.70. If g G P has no roots in [0,n] a«d f G Pd(n-) 

g(xi— )f eP d 

OXl 

Proof. It suffices to take g = — a + xi where a ^ [0,n]. We need to show that 
h = — af + xi^- e Pd- We first observe that Corollary 11.151 implies that h 
satisfies xi-substitution. It remains to check the signs of the coefficients of h H . 
If I = (ii,. . .,i d ) and f = Y_ Q i xI then the coefficient of x 1 in h is (ii — aja^ 
Since ^ ij ^ n it follows that if a < the signs of all the coefficients of 
h H are positive, and if a > n the signs are all negative. Thus, in either case 
h G ±P d - □ 

Corollary 10.71. Suppose that f G Pd(n-), and gi, . . ., gd have the property that 
none of them have a root in [0,u]. If f = Y. a i x ' ^ en 

Y_ gi(M)---9d(id)a f x f G P d (10.11.2) 

I=(ii,...,i d ) 

In particular, iff G Pd and gi, . . . gd G P pos then (|10.11.2b holds. 
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Proof. Use Lemma [10.701 equation J7.4.2I I, and the calculation 

3 9 

g(*i g^-) = Y- 9 ^ Xl aiX ' 



□ 



For example, if we take f = (x+y+l) n , gi(x) = (x+l) T and g2(y) = (y+1) 
then 

Y_ (i + D r (j + i) s f. n .)xV e p 2 



Here is a partial converse to Lemma [l0.62l 



Proposition 10.72. If f(x) zs a polynomial with the property that maps Subd 



to itself then f is in Subd- 

Proof. Let f = c^x 1 have xt degree ki. Consider the action of f(^) on the 
polynomial g = x^ 1+N ■ ■ • x^ d+N where N is large: 



Y_ ai( jci + N ) tl • • ■ (kd + N). d X^ 1+N - il • • - x 



k d +N-i d 
d 



where I = (ij, . . .,id). If f(^)9 is m Subd then so is its rever- 
sal(Proposition [l0.5|l 

Y_ ai( k 1 + N ). i • • • ( kd + N ). d x}' • • -x v d d . 

If we replace Xi by Xt/N we get that 



( Tci + N ) ( k d + N ) t t 



N 1 ! N 1 * 

is also in Subd- Since this converges to f we conclude that f € Subd- □ 



10.12 Higher order Newton inequalities 

We show how the Newton inequalities | |4.3.1| l can be interpreted as statements 
about interlacing. This perspective leads to the Newton interlacing proposi- 
tion. 

If g(x) = b + bjx + b 2 x 2 H e P pos then Newton's inequalities state that 

^ nr'" bit—ibv+i. (If we know the degree of g we can be more precise.) We 
rewrite this as the sequence of inequalities 

1 ■ bi 2 ■ b 2 3 • b 3 

> > > ■ 

b bi b 2 
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Each fraction is the negative of the root of a linear polynomial, so we can 
reinterpret these inequalities as statements about the interlacing of a sequence 
of polynomials. 

b + 1 ■ bi x<b x + 2 • b 2 x<b 2 + 3 • b 3 x< • • • 

These polynomials correspond to the double lines in the diagram below, 
where we write the coefficients of g ' and g : 

1 • b, 2 • b 2 3 • b ! 



bo bi b 2 

Now in this diagram g and g ' can be considered to be the first two rows of 
a polynomial in P 2 , as in Figure [TITll If we simply take quadratic polynomi- 
als instead of linear polynomials, then we find that the resulting polynomials 
don't always have all real roots. The solution is to introduce binomial coeffi- 
cients. 

Proposition 10.73. Suppose that f e F^ s [n). Choose a positive integer m. If 
f = Y_ a i,j x1 "y' then define the polynomials 3^ for O^k^n — mby 

j=o 

All the 9^ have all real roots and interlace: 

Consider an example where m = 2 and n = 4. There are three polynomi- 
als, which are identified by the double lines in Figure [TUTtl 

Jo = aoo + 2aoix + a 02 x 2 

3i = aio + 2a n x + ai 2 x 2 

9 2 = a 20 + 2a 2 ix + a 22 x 2 

The proposition asserts that % < 3i < ?2- 
Proof, (of the proposition) If we write 

f(x,u) = fo(u)+f 1 (y)x + f 2 (y)x 2 + --- 

then 

J k = (l + y p*f k (u) 

By Lemma [l0.68l we know that (l+y) m *f(x,u) e P 2 . The conclusion of the 
proposition follows since consecutive coefficients of polynomials in P 2 inter- 
lace. 

□ 
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10.13 Negative subdefinite matrices 

When does a quadratic form xQx t have the property that xQx 1 — c 2 G Pd for 
all c? The answer is simple: Q is negative subdefinite. 

Definition 10.74. A real symmetric matrix A is negative subdefinite if it has 
exactly one positive eigenvalue, and all entries are positive. 

Our goal is the following theorem, which will follow from Lemmas 110.861 
and ll0.87l in the next section. 

Theorem 10.75. Suppose Q is a real symmetric matrix. The quadratic form xQx 1 — 
c 2 is in Fa for all c if and only ifQ is negative subdefinite. 

In this section we establish various properties of negative subdefinite ma- 
trices. We should first observe that among all real symmetric matrices with 
all positive entries, the negative subdefinite matrices are those with the small- 
est possible number of positive eigenvalues. They can't all be negative, since 
Perron's theorem guarantees at least one positive eigenvalue. 

We first note that subdefinite matrices are hereditary in the following 
sense: 

Lemma 10.76. If A is a negative subdefinite matrix then all the principle submatrices 
of A are also negative subdefinite. 

Proof. It suffices to show that the matrix T formed by removing the last row 
and column is negative subdefinite. The eigenvalues of T interlace the eigen- 
values of A by Theorem 11.611 Thus, since A has exactly n — 1 non-positive 
eigenvalues, T has at least n — 2 non-positive eigenvalues. Now the sum of 
the eigenvalues of T is the trace of T which is positive, so T must have at least 
one positive eigenvalue. Thus T has exactly one positive eigenvalue. □ 

ao4 



103 113 



Q02 0L12 0-22 



Ill 121 1 



100 110 "?.;: 1-30 Q 40 

Figure 10.1: A polynomial in P2(4) 
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Here is an alternative characterization of negative subdefinite matrices. 
Recall that a symmetric matrix is positive definite if and only if the determi- 
nants of the principle submatrices are all positive. The k-th leading principle 
submatrix is the submatrix formed from the elements in the first k rows and 
columns. 

Lemma 10.77. Suppose that the nby n matrix A has all positive entries, and all 
principle submatrices are invertible. Then, A is negative subdefinite if and only if the 
determinant oftheY-th leading principle submatrix has sign (— l) k+1 ,/orl ^ k ^ n. 

Proof. If A is negative subdefinite, then the k-th principle submatrix is also 
negative subdefinite, and so it has k— 1 negative eigenvalues, and one positive 
eigenvalue. Since the determinant is the product of the eigenvalues, the result 
follows. 

Conversely, we prove by induction that the k-th leading principle subma- 
trix is negative subdefinite. It is trivial for k = 1, so assume that the k-th prin- 
ciple submatrix is negative subdefinite. Since the eigenvalues of the (k + l)-st 
leading principle submatrix P are interlaced by the eigenvalues of the k-th, it 
follows that P has at least k — 1 negative eigenvalues. Now, by hypothesis the 
determinant of P has sign ( — 1 ] k , and so P must have exactly k negative eigen- 
values, and therefore one positive one. Thus, P is negative subdefinite. □ 

Note that this argument also shows that if Q is negative subdefinite and | Q | 
is non-zero, then all leading principle submatrices have non-zero determinant. 
Here is a useful criterion for extending negative subdefinite matrices. 

Lemma 10.78. Suppose that Q = ( ^ ^ ) where all entries are positive, u is 1 by n, 
c is a scalar, |Q| ^ 0, and |A| ^ 0. Then, Q is negative subdefinite if and only if these 
two conditions are met: 

1. A is negative subdefinite. 

2. uA-W - c> 

Proof. We use the Schur complement formula [85, page 22]: 

|Q| = |A||c-uA-V| 

By hypothesis |c — uA^u 1 ! is negative, so Q| and |A| have opposite signs, and 
the conclusion now follows from Lemmas 1 1 0. 771 and 1 1 . 761 □ 

Example 10.79. It is not difficult to find negative subdefinite matrices. Con- 
sider the matrix 2 aj n + bl n where J n is the n by n matrix of all l's, and I n is 
the identity matrix. For example, J4 — I4 is 

/0 1 1 1\ 

10 11 

110 1 
\1 1 1 0/ 



2 In the statistics literature this is known, after scaling, as the equicorrelation matrix. 
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The eigenvalues of aj n + bl n are na + b and n — 1 b's. If we choose b < 0, 
a > and a + b > then aj n + bl n has exactly one positive eigenvalue and 
all positive entries. The matrices J n — I n are limits of negative subdefinite 
matrices. 

We can use Lemma 110.781 to construct negative subdefinite matrices. For 
example, the following matrix 



/I 

2 

3 
4 

V5 



5\ 
5 
5 
5 

5/ 



is negative subdefinite since all entries are positive, and the determinants of 
the leading principle submatrices are 1, —2, 3, —4, 5. 



We can embed negative subdefinite matrices in higher dimensional matri- 



ces. 



Lemma 10.80. 

ofd+lby d4 

Proof. Write Q 



If Q is a dby d negative subdefinite matrix, then 
1 negative subdefinite matrices. 



Q °) is the limit 



where Aisd — lbyd— 1. Consider the matrix 



M = 



A 
v* 
, av 1 



V 

c 

ac 



If we subtract a times the dth row from the bottom row we see that 



A 
v* 
av 1 



V 

c 

ac 



av 
ac 



A v av 
v t c ac 
e-a 2 c 



(e-a 2 c)|Q| 



We choose positive a, e so that e — a c < 0. Now M has all positive entries, 
and its eigenvalues are interlaced by the eigenvalues of Q. Thus, M has at 
least one positive eigenvalue, and at least d — 1 negative eigenvalues. Since 
the determinant of Q is the opposite sign from the determinant of M, it fol- 
lows that M and Q have a different number of negative eigenvalues, so M 
had d negative eigenvalues, one positive one, and so is a negative subdefinite 
matrix. Taking the limit as e and a go to zero gives the conclusion. □ 

What do cones of negative subdefinite matrices look like? Equivalently, 
when are all positive linear combinations of a collection of negative subdefi- 
nite matrices also negative subdefinite? Here is an answer for two matrices. 

Lemma 10.81. Suppose that 

1. Qi is negative subdefinite and invertible. 
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2. Q 2 has all non-negative entries. 

3. Q 1 _1 Q2 has no negative eigenvalues, 
then 

1. aQi + bQ 2 is negative subdefinite for all positive a, b. 

2. Q 2 is the limit of negative subdefinite matrices. 

Proof. We claim that if a, b are positive then aQi + bQ 2 is never singular. If it 
were then there is a -v such that 

aQi-v + bQ 2 -v = 

which implies that 

But Qf x Q 2 has no negative eigenvalue, so this isn't possible. Thus, as a, b 
vary over positive reals the eigenvalues never change sign. Since a = 1 and 
b = yields Qi which has exactly one positive eigenvalue and no zero eigen- 
values, all positive linear combinations aQi + bQ 2 have exactly one positive 
eigenvalue, and are invertible. Since all entries of Qi are positive, and of Q 2 
are non-negative, the entries of aQi + bQ 2 are all positive. This establishes (1). 
The second part follows by taking a — > and b = 1. □ 

Corollary 10.82. Suppose Qi, Q 2 , . . . , Q n are negative subdefinite invertible matri- 
ces. 

1. aQi+bQ 2 is negative subdefinite for all positive a, b iff Q^ l Q_2 has no negative 
eigenvalues. 

2. If Qi, . . . , Q n lie in a cone of negative subdefinite matrices then Q^Qj has no 
negative eigenvalues for 1 ^ i,j < n. 

Remark 10.83. If Q is an invertible negative subdefinite matrix, then the eigen- 
values of Q n are the nth power of the eigenvalues of Q. Thus, if n is an integer 
then Q 2n+1 is also negative subdefinite. Moreover, if m is an integer then the 
eigenvalues of (Q 2m+1 )- 1 (Q 2n+1 ) = Q 2n ~ 2m are all positive. Thus, 

Q Q2n+l + bQ 2m+l 

is negative subdefinite. 

Here is an example of a cone in dimension two. Define 

- (; I) 

As long as < a < b, Q Qib is negative subdefinite. Since < a' < b' 
implies < a + ta' < b + tb' for any positive t it follows that any positive 
linear combination of these matrices is still negative subdefinite. 
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The following property will be used in the next section. Recall that nega- 
tive semz'definite means that there are no positive eigenvalues. 

Theorem 10.84 ( 11331 1. Suppose that A is a real symmetric matrix that is not neg- 
ative semidefinite. The following are equivalent: 

1. A is negative subdefinite. 

2. For every vector x, xAx 1 > implies that Ax is a non-positive or non-negative 
vector. 

The following lemma is used in the next section. 

Lemma 10.85. If Q = ( v a t q ) is negative subdefinite, then v*v — aC is positive 
semidefinite. 

Proof. Assume that v l v — aC is not positive semidefinite. Then, there is a z 
such that z(v*v — aC)z l < 0. This implies that zCz* > ^zv t vz t . Evaluate the 
quadratic form at w = (a, z) where a is to be determined. 

J = ct 2 a + ol(zv 1 + vz t ) + zCz 1 

1 

> a 2 a + a(zv t +vz t ) H — zvH^z 1 
a 

1 ♦ 9 

= -(aa + zvT 
a 

If aa + zv t 7^ then wQw t > 0, so by Theorem 110.841 Qw l has either all 
non-negative, or all non-positive, coefficients. But 

Qw t = (aa + vz\ av f + Cz*) . 

Since aa + vz t can be either positive or negative, as aa + vz L becomes zero, so 
must (xv t + Cz 1 . Thus, when a = — -i-vz 1 we have 

(--vz^v* + Cz t = --{vhf - aC)z t = 
a a 

This contradicts our hypothesis that z(v t v — aC)z l < 0, and so the lemma is 
proved. □ 

10.14 Quadratic Forms 

We have seen substitution of some quadratic forms before. For instance, if 
f e P pos then f(-x 2 ) G P. Also, if f G P pos , then f(-xy) G P 2 . We will 
replace — x 2 and — xy by quadratic forms determined by negative subdefinite 
matrices. We begin with substitution into the simplest polynomial: x + c 2 : 




When is -xQx 1 + c 2 in ±P 2 ? 
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Let's first consider the case of a quadratic form in two variables. Take a 
matrix A = ( a? al ) with corresponding quadratic form aix 2 + 2a2xu + a^y 2 . 
We are interested when the polynomial 

( X ^)(;)-c 2 = a 1 x 2 + 2a 2 xu + a 3 u 2 -c 2 

is in P2 for all c. We must have that all cu are positive, and the discriminant 
should be non-negative. Thus 

4a! - 4aia 3 + 4c 2 = - | £ £ | + 4c 2 > 

and hence |A| ^ 0. A has a positive eigenvalue, since the sum of the eigen- 
values is the trace which is positive. Consequently, xAx 1 — c 2 G P2 iff A is 
negative subdefinite. 

Lemma 10.86. If Q is a A by A symmetric matrix such that xQx 1 — c 2 e ±Y&for 
all c then 

1. Q has all positive entries. 

2. Q has exactly one positive eigenvalue. 

3. Q is negative subdefinite. 

Proof. Since xQx 1 — c 2 € ±Pd the homogeneous part is in Pj, and so the entries 
of xQx* are all the same sign, and are non-zero. If they are negative then 
letting Q = (ay) and x = (x, 0, . . .,0) yields xQx 1 — c 2 = anx 2 — c 2 . This has 
all real roots iff an is positive, so all entries of Q are positive. 

For the second part we only need to assume that xQx 1 e Pd- Recall Per- 
ron's theorem |85] that says that since all entries of Q are positive there is 
a unique largest positive eigenvalue A, and the corresponding eigenvector 
v = (vi, . . . , Vd) has all positive entries. Suppose that [i is an eigenvalue with 
eigenvector u = (uj, . . .,Ud). Since Q is symmetric and A 7^ \i it follows that 
u*Qv = 0. If we set f (x) = xQx* then we can replace each Xi by v^x + ut and 
the resulting polynomial is in P since all Vt are positive. Consequently, 

f(xv + u) = (xv + ujQtxv + u)* =Ax 2 |v| 2 + |j.|u| 2 e P 

The only way that this can happen is if \x is negative or 0, so there is exactly 
one positive eigenvalue. □ 

Lemma 10.87. IfQ is negative subdefinite then xQx 1 — c 2 € Pdfor all c. 

Proof. The homogeneous part of xQx* — c 2 is a polynomial with all positive 
coefficients since Q has all positive coefficients. Thus, we only have to check 
substitution. 
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Write Q = ( v a t c ) where v is 1 x (d - 1) and C is (d - 1) x (d - 1), and let 
x = (xi, z) where z = (X2, ■ ■ ■ , x d ). With this notation 

„ t , \ ( Q /x 
xQx 1 = (X!,Z) t 



c 



= ax\ + xi (vz* + zv* ) + zCz* 

xQx* satisfies xi substitution if and only if the discriminant is non- 
negative. Now vz* = zv t since they are both scalars, so the discriminant 
condition is that 

sc 4(vz t ) 2 - 4azCz t = 4z(v t v - aC)z l 

v*v — aC is a symmetric matrix, so xQx* satisfies xi -substitution iff v*v — aC 
is positive semidefinite. Now apply Lemma [10. 851 □ 

10.15 The analog of xD 

The differential operator x • 5 = xi + • • • + XdgT^ * s * ne ana l°g m P d ° S °^ me 
operator x-j^ on P pos . Here are some elementary properties of this operator. 

1. If f(x) is homogeneous of degree n then (xjg^- + • • • + x d g|— )f = nf. 
This is the well-known Euler identity about homogeneous functions. 

2. If f e P d (n) then Hx!^- + • ■ • + x d gf 7 )fj = nf H . This is an immedi- 
ate consequence of the previous property. 

3. If g(x) —Y. iix 1 then 

f(xi^- + • • • + x d -^-)g = V a.fdIDx 1 
OXl ox d * — 

This is a simple consequence of linearity and the calculation 



o o 

(x-aix 1 ^ (xi— + --- + x d — Mxj 1 ■■■x'x) = (ii + --- + i d )xi 1 

oxi 3x d u 



x d 



The following is the basic fact about x • 9. 

9x7 ^ ' * M1 Ov, 



Proposition 10.88. Iff £ V° s [n) then (%i + ■ ■ ■ +x d gf-]f € P^° s . Moreover, 



if g is the polar derivative off then 



3 9 

nf = h hX d~ )f + 9- 

oxi ox d 



We also have interlacings 
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Proof. These are immediate consequences of the properties of the polar deriva- 
tive; see H10.4.H and (10.4.2) . 

In the case that d is two we can derive them in a different way. Notice that 

g 

— f(-xz,-yz] = -(xf x (-xz,-yz) + yfy(-xz,-yz)) 
oz 

Since the derivative is in P^ 013 , if we substitute z = — 1 we get a polynomial in 
P2° S , which shows that xf x + yf y G P2° S . Since xf x + yf y satisfies the homo- 
geneity condition, it follows it is in 

Finally since the derivative of f(— xz,— yz) interlaces f(— xz ; — yz), and is 
still true when we substitute z = — 1, the interlacing part now follows. It's 
interesting to see that f(x-J^ +y^)g = f (z9 z )g(xz,yz)| z=r 

□ 

Corollary 10.89. Iff G P pos and g = Y. G then 

Xazf(HI)x^GP p d ° s . 
i 

Proof, ocg + x • 9(g) G P^° s by the above proposition, so (x • 5 + oc)(g) G P^ 
for positive a. Now factor f, and use induction. □ 

10.16 Generalized Hadamard products 

A generalized Hadamard product is a mapping of the form 

monomial x monomial i— > constant x monomial 
This generalizes multiplier transformations which have the form 
monomial i— > constant x monomial 
We have seen two Hadamard type products 



Theorem l9T84l x i * x> 
Th.eorem. l9lS7l xVx j 



x l i = j 
ijtj 

i! x 1 i = j 
i^j 



Both of these determine bilinear maps P pos x P — > P. In Proposition ll5.51l 
we will determine sufficient conditions on constants Ct for a product of the 
form 

i i fct* 1 i = j 
x 1 ®x> ^ < 

10 i^j 
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to determine a map P pos x P — ► P. We now study two products for P x P2 
P. Define 



yUxV^jg \^ (10.16.1) 

yVxV-jjJ* 5 j = £ (10.16.2) 

Proposition 10.90. The linear transformations in 1 110.16. l|l and lll0.16.21 determine 
mappings P x p 2 — ► P. 

Proof. Choose f(x,y) = Y.^iMv x m p 2/ and g = Y.o a iV x m P- Since 
2^ an-ty 1 € P, the product 



is in P2. The coefficient of y n in the product equals aifi(x) and is in P. This 
is exactly the product determined by dl0.16.lt . 

The second product is a strengthening of the first since we can apply the 

exponential operator to 010. 16.21 to obtain 1 110.16. lb . Note that ( g(^ )-y l J (0) = 

V.cLi. Since g(-gjj)f is in P2 by Lemma |1 0.621 the conclusion follows from the 
observation that 

3 lf V „, v- ft fx) / , 9 



□ 



The same argument shows that 
Corollary 10.91. If we define 



i! x r i = k 
i^k 



t/ie« f/iis determines a map P x Pd+i — ► Pd- 

Corollary 10.92. Suppose f (x,y) = Y. hi x )v l z 's in Pd+i- The following two sums 
are in P d : 

Proof. We can take limits to see that Corollary 110.911 determines a map P x 
Pd+i — ► Ud- These results now follow if we consider e~ y2 *f and e~ y2 *'f. □ 

The reverse of a polynomial in Pd was defined in (|ll.ll.l) l. 
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Corollary 10.93. Iff = Y_ A My 1 an d g = Y. 9ii x )v l are both in Pd+i(i"0 then 

1. OiMSn-iM G P d 

2. OiMgn*) eP d 

Proof. For the first one we multiply and extract a coefficient as usual. For the 
second, apply (1) to f and g Tev = £_ g[ ev (x)y n -\ □ 

10.17 THE BMV conjecture 

The Bessis-Moussa-Villani (BMV) conjecture, as reformulated by Lieb and 
Seiringer[114|, is 

Conjecture 10.94 (BMV). If A, B are positive definite matrices then 
trace(I + (A + 1 B) n ) has all positive coefficients for all positive integers n. 

We offer a stronger conjecture. 

Conjecture 10.95 (s-BMV). if At, Bt are positive definite matrices, gi e P pos , and 
f(x,yi,...,y e ) e Pi +e then 

1. det[f (x 1+ gi(Aj + Biy) + ■ • • + g e (A e + B e y))] has all positive coefficients. 

2. If we write the determinant as Y_ fiMy 1 then all f j. e pv° s , 

3. In addition we have ■ ■ ■ < — ft < — f t+i < — 

If C is a d x d matrix then the trace of C is the coefficient of x d_1 in the 
polynomial |xl + C|. Replacing x by x + 1 and applying this fact shows that the 
strong BMV conjecture applied to det(xl + I + (A + By) n ) implies the original 
BMV conjecture. 

We can take limits and let the gt belong to P pos . A case of particular inter- 
est is the exponential BMV conjecture: 

Conjecture 10.96 (e-BMV). If Ai, Bi are positive definite matrices, 

1. det[xl + e Al+Biy + • ■ ■ + e Ae+Be y] has all positive coefficients. 

2. If we write the determinant as Y_ fi.(x)-y x then all ft e pp° s . 

3. In addition ■ ■ ■ < — ft < — ft + i < — 

It is not the case that |xl + (A + By) n \ e P 2 or that |xl + e A+By | e P 2 . For 
example, if we take A = B = I then the latter determinant is (x + e 1+v ) d which 
is a polynomial in x, but an exponential polynomial in y. 

The leading coefficient of |xl + gi(Ai + Biy) + h g e (A e + B e y)\ is 1. We 

have partial information about the constant term. 

Lemma 10.97. If A, B are positive definite and g € pv° s i\ ien ^ constant term 
(with respect to x) qfdet[xl + g(A + By)] has all positive coefficients. 
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Proof. The constant term is det[g(A + By)] . If we write g(x) = n( x + a 
where the en are positive then 

det [g ( A + By )] = f\ det [<n I + A + By ] 

This has all positive coefficients since at I + A and B are positive definite. □ 

There is one simple case where s-BMV(l) and e-BMV(l) hold. 

Lemma 10.98. If Ai,Bi,... are commuting positive definite matrices then s- 
BMV(l) and e-BMV(l) hold. 

Proof. Let O simultaneously diagonalize all the A/s and B^'s, so that OAiO* = 
diag(ay) and OBtO* = diag(by) where the ay's and by's are positive. 
Then 

|xl + gi (Ai + Bjy) + • • • + g e (A e + B e y)| = JJ^ + 9i(«H,j + *UiV)) 

i i 

All coefficients are positive since g e P pos , and ay, by > 0. The exponential 
case follows by taking limits. □ 

It is not even clear that s-BMV(2) and s-BMV(3) hold in this simple case. It 
would follow from the following 

Conjecture 10.99. Suppose g e P pos and at,bi are positive and n is a positive 
integer and write 

n 

]^[(x+g(Q 1 + b i y)) ^fiMy 1 . 

i=l 

All UeP vos and f t < — f l+1 . 

This is a special case of the composition conjecture, which we discuss in 
the next section. 

10.18 The composition conjecture 

We investigate some special cases of the composition conjecture 

Conjecture 10.100. Suppose that f (x,yi, . . .,y e ) € Pd+e, and choose gt(y) € 
P pos . If we write 

F(x,y) =f(x,gi(y ),..., g e (y)) =^f i (x)y i - 

then 

1. All ft are in P<j. 
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Remark 10.101. It is not the case that F e Pd+i; take f = x + y and g(y) = 

e y . However, if a e M then we clearly have F(x, a) e P poS d/ since we are 
merely substituting gt(a) for u t . Thus, F(x,y) satisfies substitution for y, not 
substitution for x, and the coefficients of y interlace. 

If f e P pos d+e and the composition conjecture holds then all coefficients 
of F(x,y) are positive. 

We first show if we have one exponential function then the composition 
conjecture holds. The proof is for d = 1, but an identical argument shows it 
for any d. 

Lemma 10.102. Suppose f e Pz, and g e pp° s . If the coefficient h n of y n in 
e -xg(u) i s i n p then the coefficient k n ofy n in f(x, g(y)) is in P. 
IfKv < — h n+ i then k n < — h n+ i. 

Proof. Let T g be the linear transformation 

f i ^ coefficient of y n in f (x, g (y ) ) . 
The generating function is 



G(x,y,u,v) = ^T 9 (x 1 y 



(-u) l (-v) J 



— ^ [Coefficient of y n in x 1 g (y ) j 

X.j . 

= e- ux [Coefficient of y n in e - v ^ ] ] . 

By hypothesis this is in P 3 , so T g maps P2 to P. 

The second part is similar, and considers the map 

T g ,a !-» [coefficient of y n in f (x, g(y))] + a[coefficient of y n+1 in f (x, g(y))] 

□ 

Corollary 10.103. If f e P2 t/ie coefficient k n o/y n m f (x, e y ) is in P. More- 
over, k n < — k n+ i. 

Proo/. Recall the identity 

00 ^ 

where B| is the Bell polynomial. Since the Bell polynomials are in P and con- 
secutive ones interlace, the result follows from the preceding lemma. □ 

The following conjecture arises naturally in the lemma following. 
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Conjecture 10.104. Suppose that oc\, . . . , cte are positive and n is a positive integer. 
The mapping 

x'vl 1 ■ ■ -vl* >-» x 7 (aiti + • • • + a e t e ) n 
determines a map P^ os — > Pd+e- 

Lemma 10.105. Choose positive at and let gi(y) = e Qiy . If Coniecture \10.1 Oil is 
true f/zen t/ie composition conjecture(part 1) holds for these choices. 

Proof. For simplicity of exposition we take e = 2. Write 

f(x,y) = Y_ air S x I y[y| 

Substituting for Qi yields 

F(x,y) = Y_ a.rsx'^'e"* = air S xV Q,r+a2 ^ 

We find the coefficient g n (x) of y n by differentiating n times and letting y — 

9n(x) = Y_ iirsx'tair + a 2 s) n 

and this is in P^ 0& by the conjecture. □ 
Unfortunately consider 



Example 10.106. The operator aiyi ^ + Q2i)2g^ does not necessarily map 
f = (x + y + z + 2)(x + u+2z+l) 



^d+e to i tse lf- Consider 



2 



g =yi v + 2zf z = [y{2y +2x + 3)] + [fix + 9y + 10] z + 8z : 

The constant term of g is two lines that meet at (—3/2, 0). However, the coeffi- 
cient of z is a line that does not pass through this point, so adjacent coefficients 
do not intersect, and hence g ^ P3. 

Remark 10.107. It is interesting to compute the generating function of the linear 
transformation 

T n : xVz s ^ x i {otr+ |3s) n 
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We know it does not define a map P 3 — ► P, but it does appear to satisfy 



PP° S — P. 



G (x, y , z, u, v, w) = 2_ T n (x y z J 



7~ x i (ar+ (3s) 



i!r!s! 

uVw s (-l) i+r+s 
i!r!s! 



-v ' s"- — -w 



Jc7<*- r!s! 

k=0 x 7 r,s 



n\ /v— r k (— v) r N ,t— s n k (— v) 



k=0 v 7 

v— n V / 



-W 

\ k' / 

k=0 

where B k is the Bell polynomial. If a — 6 = 1 then this simplifies to 

which is in P 4/ but it does not appear to be in P 4 if a ^ (3. 
The derivative-substitution class 

It appears that derivatives of compositions satisfy a a much stronger condition 
than interlacing. 

Definition 10.108. A polynomial f(x,y) satisfies derivative-substitution if for 
m = 0, 1, . . . and a G R 

3 m f 

1. x,a G P 

9 m f d m+1 f 
2 - 9u^ (X ' a) 3y^T (X ' a) interlace - 

Clearly polynomials in P 2 satisify derivative-substitution. The next lemma 
gives some of the properties of this class. 

Lemma 10.109. If f(x,y) satisfies derivative-substitution and f (x,y) = Y. hMv 1 
then for m — 0, 1, . . . 

2. f m G P. 

2. f m and f m+ i interlace. 

3. f (x,y + y) satisfies derivative-substitution for all yel. 
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Proof. Differentiating m times with respect to y and setting y = shows part 
1. From the second condition of the definition we have that for all (3 G K 

9 m f 9 m+1 f 

which implies that ft and ft + i interlace. 

The third part follows from the observation that 

3 m f(x,y+Y) , > 8 m f lx,y) , 

(x, a] = — x,a + Y 

dy m 9y m 

□ 

We have our final generalization of the BMV conjecture 
Conjecture 10.110. 

1. Iff[x,y) e P2 and g e P then f (x, g(y)) satisfies derivative-substitution. 

2. If A, B positive definite, and g £ P then xl + g(A + By )| satisfies derivative- 
substitution. 

For the rest of this section we consider F(x,y) — f(x,g(y)). The Faa di 
Bruno formula is an expression for the m'th derivative of a composition. In 
our case we can write 

_f(x, g(y)) = Y_ f' k) K g(y)) A m , k (y) 

" k 

Observe that this is the constant term of 

Cm \ / oo ^ \ 

k=0 / \k=0 ' / 

The right hand term is f (x, z + g (y ) ), and for ael we see that f (x, z + g ( a) ) 
is in P2. Define 

A m (x,y) =^A m , k (x)y k 

In order to show that F satisfies derivative-substitution we only need to show 
that A m (a,y) € P, and that A m (a,y)<A m+ i(a,y). 
We can do a few special cases. 

Lemma 10.111. If f € Pz then f(x, e v ) satisfies derivative-substitution. If d is a 
positive integer then f(x,x d ) satisfies derivative-substitution. 

Proof. These follow from Lemmas II . 771 and 1 1 . 781 □ 
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10.19 Recursive constructions 

In this section we see how to construct new polynomials in P d from old ones. 
We can use these constructions to get infinite sequences of interlacing polyno- 
mials that are analogous to sequences of orthogonal polynomials. 

Lemma 10.112. Let f < g be two polynomials in P d . Choose b € R, and a vector 
a = (ai, . . . , aa) of positive real numbers. With these assumptions we can conclude 
that 

1. (b + ax*)f-g eP d - 

2. (b + ax^f-g^f. 

Proof. Let h = (b + ax t )f — g. Since h + af = (b + a + ax t )f — g it suffices 
to show that he P d . All terms of maximal total degree are obtained from 
f by multiplying by various coordinates of a, so they are all positive. If we 
substitute for all but one variable then we have to show that (a + cx)f — g is 
in P, where f and g are the results for substituting in f and g. Since f <g the 
result follows from Corollary II. 221 □ 

If we iterate the construction of the lemma we can get an infinite sequence 
of interlacing polynomials. 

Corollary 10.113. Choose constants bi, b 2 , ■ . . , positive vectors ai, a2, ■ • ■ , and pos- 
itive constants c\, C2, Set f_i = 0, fo = 1 and define f n recursively by 

fn = (b n + a^X^fn-l - C n f n _ 2 - 

All i n are in P d , and we have interlacings 

f >fi>f 2 > 

Since the derivative preserves interlacing, we can use derivatives to take 
the place of f n -2- 

Corollary 10.114. Choose constants bi, b2, . . . , positive vectors ai, &2, ■ ■ ■ , and pos- 
itive vectors c\, C2, Set f o = 1 and define f n recursively by 

f n = (b n + anX^fn-i - c n (—f n -i 

\ox 

All f n are in Pd, and we have interlacings 

f >fi>f 2 > 

Example 10.115. As an interesting special case, consider the recurrence fo 
and 

9 9 

fn+l = Xf n - 9xf n = (Xi H h X d )f n - (- 1 h )f n 

OXi ox d 
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If we define one variable polynomials ho = 1 and h n = xh n _i — dh^_j then 
it is easy to verify that f n = h n (xi + • • • + Xd). The h n are orthogonal polyno- 
mials (essentially modified Hermite polynomials), and so we can view these 
f n as generalized Hermite polynomials. In § IHI13.5l we will see a true gener- 
alization of Hermite polynomials to n-variables. 

The linear combination (b + ax t )g + f also has interlacing properties. This 
generalizes Corollary II. 141 

Lemma 10.116. If f -C Q in P d and a is positive then (b + ax t )g + f <. g. 

Proof. By linearity it suffices to show that the left hand side is in P d . The 
homogeneous part is ( ax 1 ) g H , the degrees are correct, and substitution follows 
from Corollary ll.141 □ 



10.20 Two by two matrices preserving interlacing in 

In this section we generalize results from Chapter[3]to Pa- Many of the proper- 
ties of one variable have immediate generalizations to d variables. However, 
we don't have a concept of "mutually interlacing" for more than one variable, 
so Theorems such as 13. 71 are only possible for two by two matrices. 

The following Lemma generalizes Corollary 13.81 The restrictive assump- 
tions arise from the necessity to control the homogeneous part. 

Lemma 10.117. Suppose that fi <c f2 (or fj < ij) in Pd have positive leading co- 
efficients, and ( " d) ( f2 J = ( tl )• a / b, c, d are positive and if the determinant 
ad — be is positive then gi <c g2, and if it is negative §2 <C gi. 

Proof. The positivity assumptions imply that gi, g2 have positive homoge- 
neous part. Corollary 13.81 implies that gi,g2 € Pd/ an d that the interfacings 
hold as stated. □ 

As long as all coefficients are positive then the matrices generalizing Corol- 
lary [334] preserve interlacing. 

Lemma 10.118. Suppose that a, |3, a, b are vectors with all positive coordinates. The 
following matrices preserve interlacing in P d : 

foe a • x\ /a-x \ fa-x — <*\ /a-x —°<-\ 
\0 J \ a b-x) \ b-xj ^ (3 b ■ x) 

Proof. All polynomials occurring in a product have positive homogeneous 
part. Substitution and interlacing follow from the one variable case. □ 

Next, we generalize Proposition [336] 

Lemma 10.119. Suppose that M — ( £ 2) is a matrix of polynomials with in P d . M 
preserves interlacing if 
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1. g — ► f, k — ► h, and both interfacings are strict. 

2. The determinant I * , 9 I is never zero. 

I h. k I 

Proof. If M ( I ) — ( ^ ) then the homogeneous parts of u and v are positive. 
Interlacing and substitution follow from the one variable case. □ 

How can we find such matrices? Suppose that f G Pip". Recall that this 
means that the graph of f has no intersection points. Choose g € Pf en so that 
f < g. For instance, we could take g = |£. Define 

M= ( af ig 

V 9x 9x 

It follows from Lemma [l .551 that the determinant of M is never zero. 
10.21 Polynomials from matrices 

We can construct polynomials in several variables from symmetric matrices, 
and with some simple assumptions the resulting polynomial is in Pd- We 
begin with a construction for a polynomial in d variables that generalizes 
LemmaEZS 

Lemma 10.120. If C is symmetric and A 2 , . . . , Ad are pairwise commuting positive 
definite symmetric matrices, then 

f (x) = det( Xl I + x 2 A 2 + • • ■ + x d A d + C) 

is in Pd- 

Proof. Since A2, . . . , Ad are pairwise commuting and symmetric, they are si- 
multaneously diagonalizable. There is an orthogonal matrix P such that 
PAicP 1 = Dk, where = (e^) is a diagonal matrix with all positive entries 
on the diagonal. We note 



f (x) = det(xi I + x 2 A 2 + . . . x d A d + C) 

= dettP 1 ) det(xj I + x 2 D 2 + • • • + x d D d + PCP 1 ) det(P) 

As before this satisfies xi -substitution since PCP 1 is symmetric, and the ho- 
mogeneous part is 



f H (x) = det(xj I + x 2 D 2 + ■ • • + x d D d 

= H (xi + X2 e kk + ' ' ' + *d ekki- 
k 



which has all positive coefficients. □ 
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Remark 10.121. A variant of the lemma is to assume that 
f (x) = det(xiDi + • • • + x d D d + C) 
where C is symmetric, all are diagonal, and Dj has all positive diagonal 
entries. If ^/d^ 1 is the diagonal matrix whose entries are the inverses of the 
positive square roots of the diagonal entries of Di, then 

f(x) = det(Di) det (x x l + xzDzDf 1 + ■ • • + XdDdDf 1 + y / D^Cy / Df^ 

We can now apply Lemma [l . 1 201 to see that f (x) £ Pd- 

How can we find such commuting families of positive definite matrices? If 
f is a polynomial, and A is an eigenvalue of a positive definite matrix A, then 
f(A) is an eigenvalue of f (A). In particular, if f £ P pos then f (A) is positive 
definite. Since e x is a limit of polynomials in P pos and any two polynomials 
in A commute, we have 

Corollary 10.122. Suppose that f, f 2 , . . . , f a are polynomials in P v ° s , A is positive 
definite and a, ol 2 , ■ ■ ■ , a-d are positive. The following polynomials are in P d . 

det(xil + x 2 f 2 (A) + • • • + x d f d (A) + f (A)) 

detfxjl + x 2 e CClA + ■■■ + x d e adA + e aA ) 

The polynomials that arise from these constructions can interlace. 

Lemma 10.123. If D is a diagonal matrix with all positive diagonal entries, C is 
symmetric, and P is a principle submatrix ofxl + yD + C, then 

|xl + yD + C|<|P| in P 2 



Proof. We may assume P is the result of deleting the first row and first column 
of |xl + yD + C|. If D = diag(di), C = (ctj), v = (ci 2 , . . ., Ci n ) then for any 
aeK we can write 



g«(x,uj 



x + diy + en 



a 



diy 



V 




a 





P 


+ 





P 



HxI + yD + C| + a|P| 

Since g a G P 2 for any ae lit follows that the desired polynomials interlace. 

□ 

Remark 10.124. If we take y — then we recover the classical result that prin- 
ciple submatrices interlace. A similar result holds for P d - See Theorem ! 1.61 1 

If C is the all zero matrix and y — 1 then f (x) = |xl + uD + C = Yl( x + <U) 
and |P| = J . These polynomials are the ones occurring in quantitative sign 
interlacing. 
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10.22 Extending three interlacing polynomials 

Consider a polynomial in P3 



v 

foiM 



v 

foo(x) 



¥ 

fu(x) 



fioM < 

If we are only given foo<. "foi/fio then we show that we can add terms to 
foo + yfio + zfoi so that the resulting polynomial F is in P3. (See p. 12911 ) We say 
that F is an extension of foo, f 10, f 01 ■ Given one extension, we show how to find 
many more with the same initial data. 

The term fn is not arbitrary since it interlaces foi and fio, and satisfies 
foif 10 — foofii ^ 0. We show that if f o has degree two then fiofoi — fooW ^ if 
and only if there is an extension of foo, fio, foi such that W = f n. We conjecture 
that this is true in general (Ouestion ll41l l. 



Lemma 10.125. Suppose that f < g and f < h, where f, g, h. have positive leading 
coefficients. Then there are F e P3 such that if we write F(x,y,z) = Y. fij( x )y lz ' 
then 

foo = f, f 10 = g, foi = h 
Proof. If we take three sequences at, bi, C\, a constant a, and form the product 



F = + di* + bty + Cj.z) 



then 



Poo 
Fio 

Foi 



a]^[(l + Qix) 

' — 1 + QiX 

L Fqo 
Ci l + ai x 



Given f, g, h we can, following the argument of Lemma [9. 1031 find cc and the 
sequences a i; bi, Ci so that F o = f, Fi = g, F i = h. □ 

Remark 10.126. We can use a determinant identity to derive a different exten- 
sion of three polynomials. Recall (|9.1.31 . If we let D = diag(di), f = |I + xD|, 
V = (Vt] and W = (Wt) then 



|I + xD + uVV 1 + zWW l \ = 



(1 +xd t )(l +xdj; 



v t V, 
Wt Wj 
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Since D, VV 1 , WW' are positive semi-definite, the right hand side is in P3. (See 
Lemma [11. 381 ) It follows that we have an interlacing square 

f x w 1, 2 



f 



xdi)(l + xdj 



Vi Vj 



Y 

f 



^- 1 1 + diX 

There are many determinants that will give the same initial data. This 
follows from the more general result that shows that the initial data only con- 
strains the diagonals. 

Lemma 10.127. Suppose that D is a diagonal matrix, Di and Ei have the same 
diagonal, as do D2 and E2. If we write 

|I + xD+uD!+zD 2 | =^Utj(x)yV 

|l + xD+yEi+zE 2 | =^Vy(x)yV 
then U 00 = V 00 , Ui = V 10; U i = V i 
Proof. Clearly U o = V o- Let Dj = (dy) and Ej = (ey). Then 

U10 = ^(i + xD+yDj +0D 2 | y=Q 



= ^d ij |(I + xD)[i,j]| 

y 

= du J (I + X D ) [i, i] J since I + xD is diagonal 

i 

— V" eu I (I + xD ) [i, i] J since en — da 

i 

= Vio 



□ 



If we are given foe < — f 10/foi in P pos , then by the first lemma we can 
realize f 00/ f w, foi a s the coefficients of |I + xD + yDi + ZD2I where D,Di,D2 
are positive definite diagonal matrices. If we can find symmetric matrices 
Si, S2 with zero diagonal such that Di + Si and D 2 + S2 are positive definite, 
then f 00, f 10/ f 01 ar e also realized as initial coefficients of 1 1 + xD +-y (Di + Si) + 
z(D 2 + S 2 )|. 

Example 10.128. Consider the case f o G P pos (2). Given the initial data 
f 00 < f 10/ f 01 we first determine all a that arise from Lemma llO.1271 Consider 



g(B,C) = 



1 
1 



CHAPTER 10. POLYNOMIALS IN SEVERAL VARIABLES 



363 



where we choose qi, a 2 , b\, b 2 , c\, c 2 so that 

g(0,0) = foo + yfio + zfoi + <*yz. 
This representation is possible by Lemma [10.1251 Recall 

If r, s are positive then | [ * | is positive definite if and only if rs > t 2 . 

Thus, g(B, C) G P 3 (2) if and only if bjb 2 > B 2 and CiC 2 > C 2 . Now a is the 
coefficient of yz which is 

bic 2 + b 2 C! +2BC. 
Using the constraints above, we find 

(v 7 ^- V^if < (v / bTc^+ \f^cif (10.22.1) 

All foo, foi/ f 10 an d a satisfying this inequality have extensions to P3. 

Conversely, assume that foo, fio/foi are determined by g(0,0). If we solve 
the inequality 

fiofoi - af 00 > 

for a we get 1110.22. It . 
10.23 Subsets of P d 

In one variable we had two different ways to talk about P pos . We could de- 
scribe it in terms of coefficients (all the same sign), or in terms of roots (all 
negative). However, P' — 1,1 ^ has no definition in terms of coefficients, but is 
given only in terms of the location of its roots. In higher dimensions the graph 
corresponds to zeros in one dimension. We define 

Definition 10.129. If K c E d then 

P^ = {f G P d I the graph of f is disjoint from K} 
= {f GP d |f(x)=0 x^K} 

Consider some examples and properties. 

1. If K = then P^ = Pd, since there are no restrictions. 

2. IfKc K'thenP*' c P£. 

3. Let v, w be two points of and let 

K = {x G R d I x > w} U {x G R d I x sC v} 

Every solution variety of P^ must meet the segment vW. This is an ana- 
log of P^ 1 ' 1 '. 



CHAPTER 10. POLYNOMIALS IN SEVERAL VARIABLES 



364 



4. Let K = {{%,%,. ..,%) | x e 1}. Since every f e P d satisfies f (x . . . , x) G P, 
it follows = 0. 

5. Suppose K = jx e K 2 | x ^ Oorx ^ 0}\(0,0). Then, Pf consists of 
polynomials whose graphs are straight lines passing through the origin. 

6. Suppose that K = {x g K n | x > 0}. We will show that P| = P 1 * 35 . 
Since all coefficients of P^ os are positive, it follows that f e P^[ os has no 
root in K. We prove the converse by induction on d. For d = 1 we 
know that all the coefficients are the same sign if and only if no root is 
non-negative. Suppose that f e P d has no root x with x ^ 0. Write 
f (x) = Y. fi( x i/ ■ • • / x d-i)x d . If we substitute x d = then we see that 
f o (xi, . . . , Xd-i ) has no root with all coordinates non-negative, so by in- 
duction all coefficients of fo(xi,. . .,x d _i) have the same sign. Since all 
coefficients of f H are positive, all coefficients of fo(xi, . . . ,x d o-i) are pos- 
itive. 

Next, if f g P§, then r£L g P^ since the roots of lie between roots 
of f. Consequently, we conclude as above that fi(xi,. . . ,x d _i) has all 
positive coefficients. Continuing, we see f has all positive coefficients. 

Many different K can determine the same set of polynomials. For example, 
the set K = {x g M d | x > 0} and K' = K \ p where p > satisfy P£ = Pjf . 
However, if we choose p to be the origin, then P^ C P* since xi + • • • + x d is 
in P^ but is not in P^. We define 

Definition 10.130. 

K = |J{LcM d l p d =Pd} 

In some cases we can determine K. 

Lemma 10.131. Suppose that K is a closed convex region ofR d whose supporting 
hyperplanes have an equation in P d . Then, K — K. 

Proof. Since K C K it suffices that any point v not in K is not in K. Pick a 
supporting hyperplane H such that K is on one side of it, and v is in the interior 
of the other side. Choose a hyperplane H' parallel to H, and containing v. It 
follows that v lies on the graph of a polynomial that is disjoint from K, and 
hence v ^ K. □ 

Remark 10.132. In case d = 2 we see that if K is a closed convex region whose 
boundary is a non-increasing curve then K = K. 

We now ask when P^ is closed under differentiation. We say a set S is 
convex in the i-th coordinate if whenever v, w agree in all coordinates except 
the i-th, then the segment vw is in S. 

Lemma 10.133. IfR d \K is convex in the i-th coordinate then P^ is closed under 

3xi 
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Proof. To show that -M^ e P*[ we fix all but the i-th coordinate and consider 
the roots of the resulting polynomial. Since the roots of lie between the 
roots of f the convexity in the i-coordinate imply that they lie in IR d \ K. □ 

10.24 Higher dimensional recurrences 

In this section we investigate polynomials defined by two and three dimen- 
sional recurrences. Some of these polynomials are in one variable, and some 
in two. One of the interesting features is that we do not always have that con- 
secutive terms interlace. We begin by considering polynomials fy satisfying 
the similar recurrences 



fy = (x - oOft-jj + bijfy_i (10.24.1) 

fy = (x- (kjKi-u + bijfy-a (10.24.2) 

fy = (x - a0f t _y + btfi_y_i (10.24.3) 

We can visualize these recursions using lattice diagrams: 



X-Qj , 

Ti-i,5 H,j 



r aij 



btj 



fy-i f t,j i fi-y-i 

We will see that recurrence 1110.24. Il l is a disguised product, that recurrence 
1110.24. 2b fails unless the coefficients satisfy certain restrictions, and recurrence 
jl0.24.3b comes from coefficients of products in P2. 

Lemma 10.134. Suppose fo,o = 1 and f satisfies recurrence 1110.24. lb . Then fy is a 
constant multiple of nlc=i( x — Q k)- ^ n particular, all fy are in P, and consecutive 
fy interlace. 

Proof. Assume by induction of i, ) that 



fr,s = <X. r ,s Y[( X 



akj 



k-1 



for all (r, s) < (i, j). We then have 



fy = (X - ai)fi_y + byfy_i 



i-1 



= (x- ai)ai_y Qk) +byay_i]^[(x- a k ) 

1 1 

i 

= + byay_i)]^J(x- ak ^ 
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□ 

Here is a simple recurrence whose proof that all members are in P only 
depends on the additivity of interlacing. 

Lemma 10.135. Let h n = nit* + a 0/ an ^ < — 9n where all g n have 
positive leading coefficients. If we define f o = 1 and 

fn = (x+ a u )f n _i + g n 

then hn, <C fn .In particular, all ft have all real roots. 

Proof. We assume by induction that h n _i f n -i Multiplying each side by by 
(x — a n ) yields h. n <c (x — a n )f n _i. Since we also have h n < — g n we can add 
these interfacings to conclude that 

h n < (x- a n )f n _i + g n = f n 

□ 

We can apply this lemma to construct a particular sequence of polynomials 
whose members all have all real roots, but consecutive ones do not generally 
interlace. 

Corollary 10.136. If fo = 1 and f n satisfies the recursion 

n-1 

f n = (x + n)f n _j + (x + n-2)]^[(x + i) 

l 

then all f n are in P. 

Proof. Take h n = Ylii x + V)* an d 9n = ( x + n — 2)h n _!. □ 

Lemma 10.137. Suppose that fo,o = 1 and fy satisfies 410.24. 31 where all cu have 
the same sign. Then all fy are m P. 

Proof. We claim that the fy are the coefficients of the products 

i i 

ntx + bju-aj) =£fy(x)y J 

j=l j=0 

This is certainly true when i = 0. We only need to check that if we write 



i i 

Y[{x + bj-y - Qj ) = Y_ 9y (x) y ] 
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then the gy satisfy the same recurrences as do the fy: 



i-1 



J^[(x + bjU - dj) = (x + bi-y - a.i)Y_ 9i-yWl/' 
1 j=0 

and so we are done by induction. □ 

We can generalize this argument to products in more variables - this leads 
to polynomials in x with more indices. For example, consider the recurrence 

fy,icM = (x - a i )f i _ U/k + b t fy_i /k + c i f i/j/lc _ 1 (10.24.4) 

Lemma 10.138. If fo,o,o = 1 fy /k satisfies 1)10, 24.41 , a/Z bi foroe f/ze same sz'gn, 
and aZZ ct /zaae z7ze same sz'gzz C/zezz aZZ fy, k are m P. 

Proof. Consider the product below, and proceed as in Lemma [l0.137l 

i 

JJ(x-Q h + b h U + C H z) =^fy /k (x)u ) Z k 



H=l 



□ 



As another product example, consider the recurrence 



fy = (chx + bOfy-i + (ctx + dix)U-iA (10.24.5) 
whose diagram is 

Ti-i,j Ty 

dix+bi 

Lemma 10.139. //fo,o = 1/ ft, j satisfies fll0.24.5l l, and cudi - Cibi > 0/or aZZ i 
iTzen fy S P. 

Proof. The assumptions imply that all the factors in the product below are in 
P2, so we can proceed as above 

nUdjX + bj) + ( Cj +djxhj) = ^fy(x)y j 

□ 
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In fact, we can take any polynomial in P2 that has some free parameters, 
take their product, and the coefficients fy of yi will satisfy a recurrence, and 
all the fy belong to P, and consequently will interlace. 

Here is another disguised product 

fy = (x-bOfi-y + (y - d,-)fy-i (10.24.6) 

with diagram 

. x-bi . 
Ti— 1,3 f i,) 

The proof is a simple modification of the proof of Lemma HQ. 1341 

Lemma 10.140. Suppose that fo,o = 1 and fy satisfies (|10.24.6b . TTzen fy is a 
constant multiple of TT^^x — ~by) ■ rih.=i(y — ^h)- 

We can get many recurrences from products. Here is a general approach. 

Let 

g[x,y,z) = Y_ 9r(x,z)y r 
where g G P3, choose tj e E, and consider the product 

i 

]^[g(x,y,tj) = ^fy(x)y 3 

5=1 y 

We know that all fy e P, and consecutive ones interlace. The recurrence is 
easily found: 

i+1 

]^[g(x,u,tj) = g(x,y,t i+1 ) ^fi, s (x)u s 

5=1 i,s 

= (X9rfct i+ i)yj X fi - s(x ^ S 

i,s 

and comparing coefficients of yields 

fi+y = Y- 9r(x,t i+1 )f iiS (x) 

r+s=j 



Chapter 



The polynomial closure of 



In this chapter we study the polynomial closure of Pd- For polynomials in one 
variable we know that P(n) is exactly the set of polynomials in the closure 
of P(n), so the closure of P leads to analytic functions. However, there are 
polynomials in the closure of Pd(n.) that are not in Pd(n), so we have two clo- 
sures to consider: new polynomials and analytic functions. As expected, most 
properties of Pa extend to these closures. We consider the analytic closure in 
the next chapter. 

11.1 The polynomial closure of P 2 

In this section we investigate properties of the polynomial closure of P2. We 
begin with the general definitions. 

Definition 11.1. Pd(n) is the set of all polynomials that are limits of polyno- 
mials in Pd(n). P^° b (n) is defined similarly. We set 

P d =P d (l)UP d (2)U.... 

We say that f, g G Pa interlace if f + ocg S Pd for all real a. 

Although most properties of P2 extend to P2, there are some slight differ- 
ences. The polynomial x n y m is in P2 since 

lim (x + ey) n (y + ex) m = x n y m 

and (x + ey) n (y + ex) m is in P2. This shows that there are polynomials in 
P2 where the x-degree is n, the y -degree is m, and the total degree is n + 
m. Consider this example that shows how the coefficients can interlace in 
complicated ways. 
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Example 11.2. The definition of f (x,y ) as a limit of polynomials in P 2 
f(x,y) = lim (ex + y -l)(ex + y -2)(ex + y +3){x+y + l)(x + 2y + l) 
= (y-l)(y-2)(y + 3)(x + y + l)(x + 2y + l) 

shows that f e P 2 . If we expand f we get 

= (6 + 12x + 6x 2 ) + (11 + 4x - 7x 2 )y + (-9 - 21x)y 2 + 

(-13 + 2x + x 2 )y 3 + (3 + 3x)y 4 + 2y 5 
= f + f lV + f 2 y 2 + f 3 y 3 + f 4 y 4 + f 5 y 5 

These coefficients interlace in ways that are not possible for polynomials in P 2 , 
namely 

f >fl<f 2 >f3<f4. 

We can use the idea of this construction to show that if g(x) e P then 
g(x] e P 2 . Simply note that g(x + ey + e) is in P 2 and converges to g(x) as 
positive e goes to 0. Similarly, if we let e go to zero through positive values 
we see that 

ppos c pP° S 

The mapping f x g 1— > f (x)g(y) defines a map P x P — ► P 2 , and 

ppos x ppos > pP°s_ 

The two polynomials f = x n y m and g = x n_1 y m_1 do not interlace. How- 
ever, they do satisfy f ~ g in the sense that f + ag e P for all positive a. 

In order to see that polynomials in P 2 interlace we can express them as 
limits of interlacings. For instance, the interlacing xy — 1 <x + y follows from 



(xy — 1) + a(x + y) = lim 



ex + y + a vl + a 2 
+ a 2 x+ey + a 



Example 11.3. Unlike in P 2 , the signs of a polynomial in P 2 can be all nega- 
tive. This follows from the observation that if f 6 P 2 then — f 6 P 2 . This is a 
consequence of the limit 

lim (ex + ey — l)f = — f 

If f G P2(n) then we define the homogeneous part f H to be all terms of 
total degree n. Unlike P 2 , the example above shows that all the coefficients of 
the homogeneous part can be negative. There are no other possibilities - see 
Lemma lll.151 

We can apply Lemma [9.32l to deduce 

Lemma 11.4. Iff e P2(tl) theni(x,x) a«dx Tl f(x,— 1/x) are in P. 
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This shows that xy + 1 isn't in P2 even though it satisfies substitution. The 
graphs of polynomials in P2 are more complicated than those in P2. Consider 

f = (x + 4y + xy + 1) (x + y + 3xy + 2) (x + 5y + .5) (x + 2y + 1). 

The graph of f in Figure Hi .11 has two vertical asymptotes at x = —4 and x = 
—3. None the less, every vertical line except these two meets the graph in 
exactly four points. 



Figure 11.1: The graph of of a polynomial in P2 



We can use determinants to realize members of P2. 

Lemma 11.5. Suppose that C is a symmetric matrix, and D is a diagonal matrix 
where the i-th diagonal entry is duxi + dt2X2 where dtj is non-negative. The poly- 
nomial det(C + D) is in P2- 

Proof. Choose e > and let D e have diagonal entries ^(dtj + e)xt. A argu- 
ment similar to Lemma [10.1201 shows that det(C + D e ) € P2- Taking limits 
gives the result. □ 

We now find some simple polynomials in P2 that are not in P2. 

Lemma 11.6. J/f(x) e P pos then1{-xy) e ±P 2 . 

Proof. Since f is a product of linear factors, it suffices to show that — xy + a 2 € 
P2. The determinant 

x + ey a 
a y + ex 

is in P2, and converges to xy — a 2 . □ 

Corollary 11.7. Suppose a > 0. axy + bx + cy + d is in ±P2 iff \ £ \ \ ^0. 

Proof. Let f(x,u) = axy + bx + cy + d. Since (l/a)f(x — c/a,y — b/a) = 
xy — (be — ad)/a 2 we can scale and apply Lemma [11. 61 □ 
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The lemma can be derived from Theorem ll0.75l Also see Ouestion ll45l It 
is also a special case of Lemma [ll.28l The necessity is the rhombus inequality 
(Froposition llO.411 . 

Lemma 11.8. Ifxy + bx + cy + d S P2 then there are a, |3,y such that 

a + x (3 
P Y + V 

Proof. If we choose y = b and a = c then we find that (3 2 = be — d. This 
expression is positive by the previous lemma. □ 

Example 11.9. Since | "J 1 y 3 _ 2 \ = -(2x + 11) + y(x + 1) S P 2 we see that a 
multiaffine polynomial f + y g e P2 can have f , g with different leading signs. 
Of course we still have f<g. 

Example 11.10. There are simple polynomials that are not in P2. For instance, 
x 2 ±y is not in P2. Indeed, f (x,y ) = x 2 — y doesn't satisfy substitution because 
f (x, —1) ^ P. Similarly x 2 + y doesn't satisfy substitution. 

We can extend Lemma [ll.6l by replacing x by xy + bx + cy + d instead of 
by xy. The next result expresses this extension, but uses a more convenient 
replacement. 

Corollary 11.11. Iff e P then the polynomial f ((x + ct)(y + |3) + y) is in P 2 if all 

roots off are either greater than y, or all roots off are less than y. 

If f € P2° s then f(x, 0) and f(0, x) have all positive coefficients, and all 
negative roots. Thus the graph of f meets the x and y axis in negative values. 
If we intersect the graph with the line x = y then the intersection points are 
all negative. With this in mind, we establish 

Lemma 11.12. Iff e ¥ 2 ° s then f (-x 2 , -x 2 ) e P. 

Proof. By continuity of roots it suffices to assume that f € P^" 3 - The roots of 
f (— x, — x) are all positive, so we can take their square roots. □ 

This is a special case of Lemma Tl 1.1051 

Remark 11.13. If f (x,y ) satisfies x substitution, and f H is nearly homogeneous, 
f is not necessarily in P2. For instance, if f (x,y ) = 1 + x(x + y ) then f satisfies 
x substitution, and the homogeneous part has the form 

x x (homogeneous polynomial with positive coefficients). 

However, f ^ P2 since f(x, x) ^ P. 

Here's a simple fact whose proof requires several closure properties of P2. 

Lemma 11.14. If ax + by + c S P2 then a and b have the same sign. 
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Proof. After scaling and translating, it suffices to show that that x — y is not in 
P 2 . If it were the case that x — y S P 2 then we also know that x — y + 3 € P 2 . 
Since P 2 is closed under multiplication and differentiation, it follows that 

g 

f(x,y) = T-(*-v)(x-y + 3)(x + y) = 6x + 3x 2 -2xy -y 2 e P 2 
ox 

However, f (— l,y) — — 3 + 2y — y 2 has two complex roots. 

□ 

The homogeneous part of a polynomial in P 2 is not too different from the 
homogeneous part of a polynomial in P 2 . 

Lemma 11.15. If f e P 2 t/ien aZZ non-zero coefficients of f H /zaz?e f/ie same sign. 
There are non-negative integers n, m and g € P pos so that f H = ±x n y m G where G 
is f/ze homogenization o/g. 

Proof. If f H only has one term then the result is true. If not all coefficients are 
the same sign then we can write f H = • • • + rx a y n ~ a + sx b y n ~ b + • • • where 
a < b, rs < 0, and all coefficients of x r y n ~ r are zero for a < r < b. Let 
g = (d x ) a (3 y ) n - b f so that g H (x,y) = cy b ~ Q + dx b ~ a where cd < 0. 

Substituting y = 1 shows that b — a is either lor2. If b — a = 1 then 
g = cy + dx + e, but this is not possible by the previous lemma. If b — a = 2 
then we can write g = ax 2 + |3xy + yy 2 + • • • where a and y have opposite 
signs. After scaling x and y and perhaps multiplying by —1 we can assume 
that g = x 2 + |3xy — y 2 + • • • . If |3 > then d y g = |3x — 2y + (constant) which 
is impossible by the lemma. If |3 < then 3 x g = 2x + |3y + (constant) which 
is also impossible. 

The second statement follows immediately from the first since f H £ P. □ 

It is a consequence of the lemma that the polynomial f H (x, 1) has all non- 
positive roots if f e P 2 . 

If we take a product of linear factors where the coefficients of x and y don't 
all have the same sign, then the resulting product isn't in P 2 . However, if the 
constant terms are all the same sign then the coefficients are in P. 

Corollary 11.16. Suppose that b\ e R, and all di have the same sign. If we write 

n 

nty+bix+dt) ^f.My 1 

i=i 

then 

(1) ft(x) e Pfori = 0,...,n-l. 

(2) fi(x) and xfi+i(x) interlace, for i = 0, . . .,n — 2. 
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Proof. By switching signs of x and y if necessary we may assume that all the 
di are negative. Replace y by xy: 

TL 

Y[(xy + bix + di) = Y_ ftMxV 



Since the determinant of each factor is 



1 bi 
di 



dt which is negative, we can 

apply Corollary II 1.71 to conclude that the product is in P2. Consequently, ad- 
jacent coefficients interlace, which finishes the proof. □ 

If the roots of a polynomial are all positive or all negative, then the coeffi- 
cients are all non-zero. We use this simple idea to constrain where the roots of 
coefficients lie. 

Corollary 11.17. Suppose that f, g e P v ° s and s < r. If we define 
f(xy -ry)g(xy - sy) = ^~ hi(x)y l 

then hi (x) e P |s ' r] . 

Proof. We can apply Corollary 111.71 to see that fu(x) e P. If a > r then all 
coefficients of f((a — r)y) and g((a— s)y) are positive. Thus, all coefficients of 
(f [xy ~vy)g(xy — sy ) ) [oi,y ) are positive, and hence fit ( a) is not zero. 

If a < s then the roots of f ((a — rhj) and g((a — s)y) are positive. Again, 
all coefficients of (f(xy — ry)g(xu — sy))(<x,y) are non-zero, and hence ht(a) 
is not zero. □ 

If we define fo(x) = Jo [2^/— x), where Jo is the Bessel function, then we 
have the identity [89. (2.1.A)]: 

f (xy +y)f (xy -y) = f P n W^^ (11.1.1) 
t-o n!n! 

where P n M is the Legendre polynomial. Since fo has all positive coeffi- 
cients, it follows that the Legendre polynomials have roots in [—1,1], as is 
well known. We will also use this identity to determine mapping properties 
of a Legendre transformation - see Lemma [15.48l 

Any polynomial in P determines a mutually interlacing sequence (see Ex- 
ample l3~39|l . The following lemma shows that we can find a polynomial in P2 
that interpolates two of these mutually interlacing sequences. We will show 
that given polynomials f(x),g(x) with roots {Tv},{st}, we can find F(x,y) in 
P2(tl) that satisfies 

F(x,sO = g'(si)f n+1 _i(x) 

Hn,y) = f'(Ti) gn+i-i(y) 
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Lemma 11.18. Suppose that f(x) = nt x ~ 9M = E[( x ~ s i.) where\\ ^ • • • < 
r n and si ^ ■ ■ ■ ^ s n - Then 

Hx,y) = > e P2 (11.1.2) 

^■x-n y- s n+ i_i 

G x,y => — ■ ■ e p 2 (11.1.3) 

■^x + y-Ti x + y - s n+ i_i 

Proof. If we replace f (x,y) by f (x, ey ) and g(x,y) by g(ex,y ) where e is posi- 
tive, then ill. 1.3^ implies that 

y f(x + ey) g(y + ex) 
x + ey - Ti ex + y - s n+ i_ t 

and letting e go to zero establishes l |11.1.2t . The homogeneous part of G(x,y) 
is n(x + y) 2n . To check substitution, we choose a G R and let f a (x) = f(x+ a), 
9aW = 9(x + a). The result now follows from 13 .161 □ 

If Y_ hMv x € P 2 then ft and fi+i interlace. This is half true for ft and f\+2- 

Lemma 11.19. IfY. UMv 1 € P 2 tfcen ft £ -f i+2 . 
Proof. Since y 2 — oc e P 2 for positive cc, the product 

(y 2 - a) Y_ tiMv' = 5L(h+2(x) - afi(x))y l 

is in P 2 . Thus fi+ 2 (x) — afi(x) € P for all positive a. □ 

A Hadamard product with a polynomial in P^ is effectively one with a 
polynomial in P pos . 

Lemma 11.20. Iff = A (x)y l e Pf s tes ffre property that gtMfifxJy 1 € 
P 2 OS / or flH L 9iM ui G Pf s ften f = F(x) • aty 1 latere F(x) e P pos anrf 

2_ a H!(n-i)! fc 1 

Proof. If we let g = (y + l) n then 

^W^QfiM^ePr 



Applying f again yields 

Since adjacent coefficients interlace, Lemma [T .341 implies that ft = F(x)dt for 
some constant Qi. Thus 

Z(-)W = n!F(x)^a il ^ePr 
which proves the theorem. □ 
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11.2 Reversal and cancellation in P? 



In this section we first consider two properties that are trivial for polynomials 
in one variable. First, the reversal of a polynomial in P is in P, and second, if 
xf 6 P then f € P. Neither is immediate in P2, and we do not know the latter 
to be true if there are more than two variables. Finally, we discuss differential 
operators that preserve P d . 

The key to proving that xy — 1 e P was the identity 



xy - 1 = 



x 1 
1 V 



which can be modified to give a sequence of polynomials in P2 converging to 
xy — 1. Now xy — 1 is the reversal of x + y; we have a similar matrix argument 
that shows that the reversal is in P2. If we use 1 /x instead of — 1/x (See p. 12701 ) 
we get stable polynomials (See p. 16201 ). The key is the following well known 
matrix identity for the determinant of a partitioned matrix: 



An 
A 2 i 



A12 
A 22 



I An I |A 2 2-A 2 iA 11 1 A 12 | 
It follows easily that for matrices A, B of the same size 



(11.2.1) 



|xB - A 



xl A 
A B 



Lemma 11.21. I/f(x,y) g Pi{n), then y n f(x, — 1/y) G P2. Equivalently, if 'f = 
^fiMy 1 g P 2 [n), then^hWi-y)^ g P~2 

Proof. If we write f (x,y ) = |I + xDi + yD2| where Di, D2 are positive definite 
then 



x n f(-l/x,y) 



xl — Di + xyD2 
x(I + yD 2 )-Di 

xl \/Di 
/ Di I + yD 2 




I 



I 






/TJT 







This represents f as a determinant of a matrix of size 2n that is clearly the limit 
of determinants of matrices that are in P 2 . □ 

Remark 11.22. Here is an alternative geometric argument. Define 

fe(*,y) = {V + ex) n f(x+ ey,-l/(y + ex)), 

where e is positive. Since lim e ^ofe = f, we need to show that f e g P2(2n). 
The homogeneous part of f e is clearly (x + ey) n (y + ex) n , and so all coeffi- 
cients of fg are positive. If we fix y, then the locus of {(x+ ey, — l/(y + ex))} as 
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x varies consists of two hyperbolas that open in the second and fourth quad- 
rants. Consequently, every solution curve of f meets each of these hyperbolas 
- see Figure [9761 . Thus, for any y, there are 2n solutions, and substitution is 
satisfied. 

Proposition 11.23. The two variable reversal of a polynomial in Piin) is in P2(2n). 

The two variable reversal ofi is f REV = x ri u n f(l/x, 1/y). 

Proof. If we apply the lemma twice weseethatx n y n f(— 1/x, — 1/y) G P2. Since 
the substitution of (— x, —y) for (x,y) preserves P2, the result follows. □ 

Note that the reversal of x + y is xy — 1, whereas the two variable reversal 
of x + y is x + y. 

The reverse of a polynomial in P2 satisfies the assumptions of the following 
lemma. 

Lemma 11.24. Suppose that f (x,y) = Y. ^\Mv l £ Pi or P2, and each ft(x) is a 
polynomial of degree i. If the leading coefficients of the ft alternate in sign then 

g(x,y) GP 2 (n) => f(x, -d v ) g(x,y) e p 2 (n) 

Proof. If we choose a € Kthenf(a,y) isinP. It follows from Lemma l9792l that 
f (a, — d y ) g(a,y) € P, and so f(x, — 9 y ) g(x,y) satisfies y -substitution. 

Suppose the leading coefficient of fi(x) is Ci.. Since the sign of Ct alter- 
nates the homogeneous part of ft(x)(— d y ) l g(x,y) has degree n, and equals 
Ci(— x) l (dy g) H . The homogeneous part of f(x,— d y ) g(x,y) has all positive 
signs (or all negative), and so the first part is proved. The second part is simi- 
lar. □ 

We need a lemma about truncating polynomials in P before we can pro- 
ceed to canceling x in P2. 

Lemma 11.25. Suppose that f e P{m—1) does not have multiple roots. Ifi n — ► xf 
where all f n e P(m) then f n (x) — f n (0) € P/or n sufficiently large. 

Proof. If ri,...,r m _i are the roots of f, then none of f(ri),. . .,f(r m _i) are 
zero since f has no multiple roots. Let a be the minimum value of 
|f (ri ) [,. . . ,|f (r m _i)|. Since f n (0) converges to there is an N such that if n > N 
we have |f n (0)| < a/2. And, if we choose N sufficiently large we can require 
that the minimum of {|f n (r) | f'(r) = 0} is at least a/2. Notice that the graph of 
fnM — fn(0) is just the graph of f n (x) shifted by less than a/2. Since all the 
critical points of f n M are more than a/2 away from the x axis, f n (x) has all 
real roots. □ 

Proposition 11.26. If xf e P 2 then f e P 2 . 

Proof. Since xf e P 2 (m) there are polynomials h n (x,y) e P 2 (tti) such that 
h n — > xf. We can write h n (x,y) = xg n (x,y) + k n (y). Since g n — > f it 
suffices to show that g n e P 2 . Now h n satisfies the homogeneity condition, 
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and therefore so does g n . We need to show that g n satisfies substitution. If we 
choose ct G R then xg n (y, oc) + k n (oc) — ► xf (y, oc). The lemma above implies 
that g n {y, oc) G P for n ^ N a . Now we can apply Lemma [l 1 . 1 1 81 to conclude 
that there is an N (the maximum of all the N a 's corresponding to roots of the 
resultant) such that n > N implies that g n [y, oc) G P for all a G R. 



We would like to prove the converse of this. 
Lemma 11.27. If f (9 X , d y )g is in P2 for all g G P 2 then f z's in P2. 
Proof. Consider the calculation 

fO x ,3Jx R u R = Y_ a ij (R) i (R) j x R - i y R -^ 

= x R_^R-n J" Qij (R).(R). x — iy— i 

0^i,Kn 

Replacing x by x/R and u by y/R yields 

R-u R-n y (|k®i x ~-i y n-j e p 

0^i,Krt 

and so we can divide out the initial factors to conclude that 

y a,fi®ix-v--ep 2 

£— v ' R 1 Ri 



Now letting R — > 00 shows that 



□ 



Y_ o tJ x n - l y n - 3 G P 2 



Taking the reversal of the last expression and applying Prop osition [TL23] fin- 
ishes the proof. □ 

11.3 Polynomials linear in y 

The polynomials of the form f (x) +y g (x) are well understood. First of all, since 
f + ag is in P for all a, it follows that f and g interlace, and their degrees differ 
by at most 1. If f and g have the same degree then we can write f = (3g + h 
where g < h, so that f + yg = h + (u + (3 ) g. Substituting y = y + |3 shows that 
we can always assume that the degrees differ by one. The next two lemmas 
show that the condition of interlacing is all we need to belong to P 2 . 
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Lemma 11.28. Iff < — geP have positive leading coefficients then the polynomial 
f +ijg lies in P2. 

Proof. If f < — g then we can write 

n 

f = (ax + b)g-^c? 9 



x + at 



where g = Yli x + a i) an d Q is non-negative. Since f + y g can be expressed as 
the determinant 



y + ax + b Ci C2 

Cl X + Qi 

C2 X + Q2 







X + Q n 



it follows that f + yg e P2. 



□ 



Lemma 11.29. I/f 

yf — g Zies m P2. 



g 6 P /«z>e positive leading coefficients then the polynomial 
Proof. The reversal of f + y g is y g — f, so yg — f S P2. 

□ 



Remark 11.30. If we choose fo < fi and go < gi then both fo +yfi and go + ygi 
are in P2. Consequently their product 

(f + yfi)(go +ygi) = fogo + (fogi + figo)y + figiy 2 

is in P2, and so the coefficients of the product interlace. This explains the 
Leibnitz property (Lemma II. 261 : 

fo9o <fo9i +fi9o <figi- 
The following lemma generalizes the Leibnitz property. 



Lemma 11.31. Suppose ft 
and define 



Then 



— gt have positive leading coefficients for 1 ^ i ^ n 
= fr 9{i,... / n.}\z 



|I|=k 



hi < — h 2 < — ■ • • 1 — h n 
Proof. The h^'s are consecutive coefficients of Oi^i + V 9i)- 
Example 11.32. In the case n = 3 the interfacings are 

fif2f3 < — fif293 + figaf3 + gif2f3 < — fig2g3 + gif2g3 + gig2f3 < — gig2g3- 

The first and last interlacings are easy but the middle one is not so obvious. 



□ 
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Here is a similar result for common interlacings. 
Lemma 11.33. If fj, . . . , f n are in P and have a common interlacing then 



Proof. If g < ft then g+yfj e P 2 and therefore TT(g-Pyfi) € P2. The coefficient 

i=i 

of y s is in P and is g n ~ s times the desired polynomial. □ 

Corollary 11.34. Suppose f = fi(x) + yf2(x) and g = gi(x) + yg2(x) are zn P2 
where i\, ii, gi, g2 have positive leading coefficients. Then f <gijf 

1. fi<gi 

2. f 2 <g2 



1 I fl f2 I < 



Proof. This follows from Lemma [l . 5 1 1 and Lemma [1.521 



□ 



There is no known criterion for a polynomial that is quadratic in y to be- 
long to P 2 . The following lemma gives a general construction for such poly- 
nomials, but it is not known if there are quadratics in y in P2 that are not given 
by this form. 

f 

Lemma 11.35. Suppose that f = (x + ci) • • ■ (x + c n ), ft = — ; , and ft, 

f 



Ci 



(X + Ci.)(x + Cj 



-. For any choice o/at, bi the following polynomial is in P2: 



y 2 f-y fe>? + b?) f i)+L|£3 

\ i / i<j 



Proof. It suffices to show that the polynomial in question is the value of the 
determinant 

y Qi ... a n 
y bi ... b n 
ai bi x + ci 







X + C T 



(11.3.1) 



The coefficient of y 2 is clearly f . To find the coefficient of y, consider the 
minor corresponding to the first row and column: 



y bi 
bi x + ci 



X + C r 
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We have seen that this determinant has value yf — Y_ bffi.. The yf term 
comes from the y 2 f term in ill. 3. It , and the remaining part is part of the co- 
efficient of y . Similarly, the value of the minor corresponding to the y in the 
second row and column of (|11.3.1t is yf — a ffi- Combining these gives the 
coefficient of y . 

The constant term of the determinant 1111.3. Il l is the value of the determi- 
nant 

Qi ... Q n 

bi ... b n 
qi bi x + ci 







Expanding using the first two rows (and some patience) gives the final 
result. 

□ 

Here is another interpretation of the last result. 

Corollary 11.36. Suppose f e P(u), f <c g, f <h, and the leading coefficients of 
f, g, h are all positive. There is a k 6 P(n — 2) so that 

k-y(g + h)+y 2 f eP 2 

Proof. Use Lemma [1 .201 to write g = Y. Q ?fi an£ i a = Y. bffi, and now apply 
Lemma [TT351 □ 

We can generalize this corollary to three variables. 

Lemma 11.37. Suppose f e P(n), f < g, f < h, and the leading coefficients ofi, g, h 
are all positive. There is a k e P(n — 2) so that 

k — yg + zh + uzfe P3 

Proof. Following the evaluation in Lemma Hi. 351 we find that 



F(x,y,z) 



z a 1 
y bj 
aj bi x + Ci 



a b n 

r \^ — I a i a i 

= yzf -zh-y g + 2_ b, b, 



where f, g, h are as in the proof of Lemma 111.351 The polynomial k 



bi bj 



fij is in P since it is just F(x, 0, 0). 



□ 
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If we take g = h. then k is zero, and so yzf — (y + z)g e P 2 . If we compute 
the determinant 



z 















V 





Ql 


a n 








w 


CLl 


a n 


ai 


o-i 


dl 


X+ Ci 





a n 


0-n 







. . x + c 



then we get y zw f — (yz — y w — zw) g which is therefore in P 4 . More generally, 
if we compute the determinant of 

diag(yi) (A,..., A)* 
(A 1 ,..., A 1 ) diag(x + d) 

where A = (ai, . . ., a n ) then we find that 

yi • • -yn 



(yi---y n )f-g Y.' 



G Pn+1- 



One way to create polynomials of a fixed degree in y is to use positive 
semi-definite matrices. 

Lemma 11.38. If Dj is an n by n positive definite matrix, and D 2 is positive semi- 
definite of rank r, f/ien |I + xDi + yD 2 | G P2, ftfls degree r in y. 

Proof. If we diagonalize D 2 , then there are exactly r non-zero terms on the 
diagonal, so the maximum degree is r. The coefficient of y r is the determinant 
of the principle submatrix of I+xDi corresponding to the zero diagonal entries 
of D 2 , and is a polynomial in P pos of degree n — r. □ 



Example 11.39. If v\, . . . , va are vectors then 



Y_ X i V i V i 



is linear in each vari- 



able since v* vt is a semi-definite rank 1 matrix. 



Example 11.40. There is a simple formula if D 2 has rank 1. If v is a vector of 
length n, A is n by n, and f = 1 1 + xD | then 



\l+w t \ = 1 +v l v 
|I + yvv 1 ] = 1 + yvv 1 
|A + yvv* = |A|(1 +yv t A _1 v) 
II + xD+yvv*! =f(l+yv t (I + xD)- 1 v) 



(11.3.2) 
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If D = diag(di) then 



II + xD+yvv* = f + y J~ v l 



f 



1 +xdi 



Since the coefficient of y interlaces the coefficient of f , we have another proof 
of part of Lemma [L20l If f < g then g — Y. a iV(l + x &i) where > 0. If we 
set w = (-/aT) then 



This is a different determinant realization of f + yg (See p. 13791 ). 

11.4 Constrained polynomials with zero coefficients 

We look at some conditions on polynomials in P2 that imply they have a sim- 
ple form. First of all, there are no non-trivial ways of adding one variable 
polynomials to get a polynomial in P2. 

Lemma 11.41. Iff[x) + g(y) £ P2 then at least one of the following holds 

1. f or gis constant. 

2. f and g have degree 1. 

Proof. Assume that neither f nor g is constant. If f is linear then it takes arbi- 
trarily large positive and negative values, so by Lernma ll.l2| q is linear. If both 
f and g have degree at least 2 then the lemma implies that they have degree 
exactly two. Moreover, if f has positive leading coefficient then g has negative 
leading coefficient, but this is impossible for then (f (x) + g(y )) H does not have 
all coefficients of the same sign. □ 

Linear combinations of the form f (x) + g(x)h(y ) are trivial if deg(h) > 1. 

Lemma 11.42. If f(x) + g(x)h(y) S P2 where deg(h) > 1 then g is a constant 
multiple ofi. 

Proof. If deg(h) > 2 the set {r e K h(y) + r e P} is bounded. Since 
My) + f(r)/g(r) e P for all r such that g(r) ^0 we have that 



is bounded. Thus f/g extends to a bounded rational function on K which is 
constant by Liouville's theorem. 

If the degree of h is 2 then we may assume that f and g have no common 
factors. If g has a root then f/g takes on arbitrarily large positive and negative 
values, so by Lemma 11.121 h. is linear. Thus g is constant, but this contradicts 
Lemma [11.411 since h has degree 2. □ 



|I + xDi +yww t | = f + yg 
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If • • • + ax 1 + x t+1 + b x t+2 H 6 P then Newton's inequality implies 

that ab ^ 0. The analogous result in two variables is much stronger. 

Lemma 11.43. I/f(x,-y) = h fi(x)u l + 0u i+1 + f i+2 (x)u i+2 + • ■ ■ e P 2 and 

ft and f i+ 2 are nof zero then thereare g,hePso f/zaf f(x,y) = g(x)h(u). 

Proof. If we differentiate, reverse and differentiate with respect to y suffi- 
ciently many times we find that there are constants a, |3 so that afi(x) + 
|3fi+2My 2 € P 2 . By Lemma HHl] we see that ft +2 is a constant multiple 
of ft. 

We can thus write 

f (x,y) =■■■ + f t _ x (x) y 1 - 1 + ft(x) (a + bu 2 )u l + • • • 

for constants a, b. Differentiating and reversing yields constants Ct so that 

Ci-i fi-i (x) u 1 - 1 + ft (x) (ct + ct+i y 2 ) y l e P 2 

Applying the lemma again shows that ft-i is a multiple of ft- Continuing, we 
see that we can write 

f(x,y) =ft(x)H(u)+f i+1 (x)u i+1 + -.- 

Reversing, we apply the same argument to finish the proof. □ 

Corollary 11.44. Iff € P 2 has all exponents of even degree, then f (x,u) = g(x)h(y) 
where g,he P. 

Proof. If we write f (x,u) = Y. fiMu 1 then if i is odd the hypotheses imply 
that ft(x) = 0. The conclusion now follows from the lemma. □ 

11.5 The polynomial closure of Pd 

If V is the vector space of a polynomials in x\, . . . , x& of total degree at most n, 
then Pd(i"i) is contained in V. The closure of Pd(n-) in V is denoted Pd(n), and 
the closure of P^ os is P^ 0b '. We define 

P d = P d (l)UP d (2)U.... 

In this section we show that most properties of Pd apply to its closure. We 
also identify some specific polynomials in the closure that are not in Pd- 
First of all, all monomials are in the closure. 

Lemma 11.45. All monomials of at most d variables are in Pd- 

Proof. The monomial xj 1 • • • x d d is the limit of these polynomials in P d : 

(xi + e(x 2 + ■ • • + x d )) 11 • • ■ (e(xi + ■ ■ • + x d -i) + x d ) id . 

□ 
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Next, we have some important containments. 
Lemma 11.46. If d < e then P d c P e . In the case & — 1 we have P C F 2 , 



Proof. It suffices to assume e = d + 1. Choose f € Pa( n )- Since f is in the 
closure of Pd(u) there is a sequence fi,f2, ■•■ of polynomials in P,j(ti) that 
converge to f. For any positive e we know that fjxi + exa+i/Xa, • • -/ x d) is 
in Pd+i- Since lim e ^o fi(xi + exd+i,X2, ■ • - ,Xd) = ft we see that f is the limit 
of polynomials in Pd+i- It is clear that if the homogeneous part of f has all 
positive coefficients, then so does the homogeneous part of 
f(xi + exd + i,X2, . . . ,Xd). This implies that Pd C P e . 

If f £ P(tx) then f(x + ey) has homogeneous part c(x + ey) n and so all 
f(x + ey) are in P2. If f £ pP os then all its coefficients are positive, and so the 
same is true of f (x + ey). □ 

We can use determinants to realize members of Pd- The proof is the same 
as Lemma [11. 5 1 

Lemma 11.47. Suppose that C is a symmetric matrix, and D is a diagonal matrix 
where the i-th diagonal entry is Y_ dijXt where dtj is non-negative. The polynomial 
det(C + D) is in P d . 

Before we go too far, let's see that there are polynomials that are not in P d . 
Consider this simple consequence of Lemma [10.591 : 

Lemma 11.48. Ifi (x, . . . , x) does not have all real roots then f ^ P d . 

This implies that xuz + 1 is not in P 3 , because ±x 3 + 1 does not have all real 
roots. We can slightly generalize the lemma to say that if we replace some of 
the variables by ±x and the resulting polynomial is not in Pd then neither is f . 
This shows that xy + z is not in P3 since ±x 2 + z is not in P2 by Example II 1.101 

There are simple polynomials in P3. Since 



it follows from Lemma [11.471 that xyz — x — z <E P3. 
Lemma 11.49. The elementary symmetric functions are in P. 

Proof. The i-th elementary symmetric function of x\, . . . ,x d is the coefficient 
of u 1 in the product 

[y +xi)(u + x 2 ) • • ■ (y + x d ) 

Since each of the factors is in Pd+i, the product is in Pd+i, and so the coeffi- 
cients of y are in Pd- □ 

As long as there is no obvious restriction, the interior of Subd is contained 
inP d . 




x 1 

1 V 1 
1 z 



= xyz — x — z 
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Lemma 11.50. Iff e int Subd and f H has all non-negative terms then f £ Pa- 
Proof. With these assumptions on f the reason that f might not be in P a is that 
f H might have some zero coefficients. If f has total degree n then we write 

f = X cixI 9= Y. *' 

I |I|=n,ci=0 

For any a > the polynomial f + ag satisfies the homogeneity condition. 
Let be an open neighborhood in Subd(n) that contains f. For sufficiently 
small a we have that f + ag is in 0, so for these a it follows that f + ag satisfies 
substitution. Since the homogeneous part of f + ag has all positive coefficients 
f + ag is in Pd(n), and converge to f. □ 

Recall the definition of interlacing: 

L g £ Pd interlace iff f + ag is in Pd for every a. 

The following are all consequences of the corresponding results for Pd- 

Theorem 11.51. Suppose f £ P^. 

1. Ifg£ Y e then fg £ Y d+e . 

2. Ifi<g,hand g,h£ Pd then f <; g + hand g + h£ Pd. 

3. f and interlace. 

4. Linear transformations extend from P — ► P to Pd — ► Pd- 

5. Polynomial limits of polynomials in Pd are in Pd- 

6. Iff = Y_ ai(x)y ffen aj(x) e Pd 

We can generalize Lemma Hl.28l to Pd, but we no longer have a nice deter- 
minant representation. 

Lemma 11.52. If f < g e Pd then f + x d +ig £ Pd+i- 

Proof. To simplify notation we assume that d = 3. Define a polynomial in four 
variables 

F e = f (x + eu,y + eu, z + eu) + ug(x + eu,y + eu,z+ eu). 

Since lim e ^ F e = f + ug it suffices to verify that F e £ P 4 . The homogeneous 
part satisfies 

pH _ fHj x _|_ eu y _|_ eU/Z + eu) +ug H (x+ eu,u + eu,z+ eu). 

Since both f H and g H have all positive terms, it follows that F^ 1 also does. It 
remains to check substitution. Setting y — a, z — (3,u = y we need to show 
that 

F e (x, a, (3,y) = f(x + ey, a+ ey, |3 + ey) +yg(x + ey, a+ ey, (3 + ey) 
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is in P. Since f < g in P3 it follows that 

f(w, oc+ ey, (3 + ey) < g(w, a + ey, (3 + ey) 

Since any linear combination of these two polynomials is in P we add y times 
the second one to the first and replace w by x + ey to finish the proof. □ 

If we don't introduce a new variable then the result is easier. 

Corollary 11.53. Iff < g in P d then xig + f e V A for 1 < i < d. 

Proof. Use Lemma llO.1161 □ 

The polynomial f +y g+zh+yzk is in P3 if f, g, h, k satisfy certain interlacing 
assumptions. Here's a more general result. 

Lemma 11.54. Ify— (t)i,. . .,y&) and F = Xi^MOiCy) satisfies 

1. x-substitution 

2. deg (f o...oM) ^ deg (f r (x)) + deg (g r (y)) for all I 

3. all f z and g r /zaz;e all non-negative coefficients 
then F e P d+ i 

Proof. The notation fo...o refers to the terms that contain no yt's. Let G € = 
F(x + ey,y) where y = y\ + ■ ■ ■ + y d and e > 0. If n = deg(fo...o(x)) then 
condition two implies that n ^ deg (fi(x + eyjg^y)) so the degree of G e is 
n. The homogeneous part of G e includes a term (x + ey) n , and the third 
condition guarantees that all other contributions are non-negative, so G e has 
all positive terms. If we let a = (a\, . . . , a d ) then G e (x, a) = F(x + ea, a) where 
a = ai + • ■ ■ + a d . Now F(x, a) has all real roots by the first condition, and so 
the translate by ea is also in P, and so G e (x, a) satisfies x-substitution. Since 
lim e ^o G e = F, it follows that F e P d +i. □ 

Corollary 11.55. Suppose that 

T(x,y,z) = f 00 (x) +yfio(x) + zf i(x) +-yzf u (x) 

satisfies 

1. F(x, a, b) G Pfor all a,bel 

2. foo € P pos - 

3. deg(foo) ^ deg(fio) + 1 

4. deg (f 00 ) > deg (f 01) +1 

5. deg(foo) > deg(fn) +2. 

6. All f ij have non-negative coefficients 
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then F £ P 3 . 

Remark 11.56. We can multiply the Taylor series and extract coefficients to 
derive some properties of derivatives (Theorem l9.39l Corollary l9.93|l . Starting 
with f € P2 and writing 

f(x + u,y +v)= f(x,y] + f x (x,y)u + f y (x,u)v + 

2 2 

it u 

fxx(x,u)y + fxy(x,u)uv + f yy (x,u)y + ... 

we first multiply by u + a. The coefficient of u is <xf x + f . Since this is in P2 for 
all a we conclude f < f x . 

Next, choose oc > and multiply by u 2 — 2 a € P4. The coefficient of u 2 is 
af** — f, so we conclude that f — <xf xx 6 P2 for all positive oc. If instead we 
multiply by 1 — ocuv 6 P2 we find that the coefficient f — <xf X y of uv is in P2. 

Since (u + 1) (v + 1) — 1 = uv + u + v is in P2, the coefficient f x + f v + f xy 
of uv is in P2. 

In general, if g(u, v) = Y. a TS u r v s £ P2 then 

f(x + u,y+v)=^DiD5 J fy^ 

v— • • u i+T v j + s 
f(x + u,y+v)g(u,v) =^a rs DiD^f— — 

The coefficient of v n u m is 

Z_ tiji u ^ u y' 

and is in P2. 

Lemma 11.57. Suppose that f (xj, . . . , Xd ) — Y. a i x ' zs fl non-zero polynomial, and 
define 

g(x) = f (eio + Y_ e i) x i' ■ • ■' £ do + X. ed ' x ') 

If I is an index for which a/ = then the coefficient of x 1 in g (x) is a non-zero 
polynomial in the etj . 

Proof. A monomial ajX J in f contributes a product 

aj EI ( £k0 + Y- £kiXi ) (11.5.1) 

to g(x). Every term in this expansion has the form ■ ■ ■ e ] d d ? . where each "?" 
represents an index in 0, . . . , d. Consequently the coefficient of x 1 in g(x) is a 
sum of terms of the form 

(coefficient of x J ) x (at most one term arising from x J ) 

and so there is no cancellation. Note that ( |11.5.H has terms of every index of 
degree at most deg(f ). □ 
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Lemma 11.58. If f (x) £ Pd has non-negative coefficients then it is a limit of polyno- 
mials in P^ os . 

Proof. Define 

f e (x) = f (eio + e^Xj,. . ., e d0 + e dj Xj). 

By Lemma fl 1 .571 the terms in f e with coefficient zero in f are a sum of polyno- 
mials in the etj with positive coefficients in g(x). Observe that f is the limit of 
f e through positive values of £a — > 1 and e^j — > 0. All the f e are in P d , have 
all positive coefficients for all indexes of degree at most deg(g), and so f is a 
limit of polynomials in P^° s . □ 

Interfacings in P d can be approximated by interfacings in P d . 
Corollary 11.59. Iff < — g in Pa then there are f n , g n such that 

• lim f tl = f 

• lim g n = g 

• fn < 9n 

Proof. Since f + yg € Pa+i we let h e be the polynomial determined by the 
lemma above that converges to f +ug. If h e = f e (x) + yg e (x) + • • • then f e , g e 
are the desired polynomials. □ 



11.6 Multiaffine polynomials 

A multiaffine polynomial is a polynomial that is linear in each variable. We look 
at a few properties of such polynomials; for a more detailed discussion see 
[bbs]. 

Definition 11.60. 

MA d = {f 6 P d | all variables have degree 1} 
We can write a multiaffine polynomial as 

Y aix r 

IC{l,..,d} 

A multiaffine polynomial in MA d can be visualized as an assignment of real 
numbers to the 2 d vertices of the d-cube. 

There is no simple way of determining if a polynomial f (x) is multiaffine. 
Branden) 23 ] proved that a necessary and sufficient condition is that 

3f 3f , 3 2 f „ r „ , 
— — ~ f - - > for all 1 ^ n. 

OXi OXj OXiOXj 

In the case d = 2 this is the simple criterion 

a + bx + cy + dxy g MA 2 <=> ad - be < 



CHAPTER 1 1 . THE POLYNOMIAL CLOSURE OFP d 



390 



Example 11.61. Here is an application for a polynomial of total degree 3. Con- 
sider the question: 

For which m is f (x,y, z) — xyz — x — y — z + rn.6 MA3? 

Using the criterion we find 

9f 3f , 9 2 f , 

^r^ f • ^r- = z 2 - zm + 1 

ox oy oxoy 

This is non-negative if and only if |m| ^ 2. A similar calculation shows 

xyz — ax — by — cz + m € MA3 iff a, b, c and m 2 ^ 4abc 

Here are three examples of general multiaffine polynomials in MA^; the 
first two are trivial to show multiaffine, but the last is the Grace polynomial 
Graced and does not have an elementary proof that it is in MA^. 



1 

d 



1 

d 

cr^sym(d) 1 

It's easy to construct multiaffine polynomials using this lemma: 

Lemma 11.62. Iff e Pd then the multiaffine part off is in MA^- 

Proof. The usual reversal, differentiation, reversal shows that if Y_ ft My 1 £ 
Pd then fo(x) + fi(x)y S Pd- Repeating this for each variable shows that the 
multiaffine part of f is in Pd- □ 

Example 11.63. If we let f = (1 + x + y + z) n then the multiaffine part is 

1 +n(x + y + z) +n(n— l)(xy +xz + yz) +n(n — l)(n — 2)xyz 
and is in MA3. 

This last example is an example of a multiaffine polynomial that is sym- 
metric in its variables. See Lemma [10. 531 

Multiaffine polynomials are closed under multiplication in the following 
sense 

Corollary 11.64. If f, g e /W/Ad the the multiaffine part off g is also in MA^. 
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We can also construct multiaffine polynomials using determinants. If S is 
symmetric and Di, . . . , D d are positive definite then the multiaffine part of 

|S + xiDi + ---+x d D d | 

is multiaffine. We don't always have to take the multiaffine part. If v\, ■ ■ ■ , v d 
are vectors then 

|S +xiv*vi + hx d v d v d | 

is multiaffine. 

There are no interesting multiaffine bilinear forms. 

Lemma 11.65. J/f = YJ\ aijXiUj <= MA 2 & then we can write 

n n 

f = (Y_ oCiXi) ■ (Y_ PiUt) 
1 1 

where all at have the same sign, and all |3i have the same sign. 
Proof. Write (cuj) = (vi, . . .,v d ) 1 so that 

/vA /tji 
f = (x x ,...,x d ) : : 
\ v d/ \yd; 

We see aiS^r = °' and hence > °- If y = ^ y ^ then ^ = v i • y 

and ^i- = Vj • y. Consequently, 

(vi-y)(vj -y) >0 

If the product of two linear forms is never negative then they must be multi- 
ples of one another. Thus we can write Vj = a,vi for some constant aj which 
leads to the the desired form 

f = [aixi H h a d x d ] • [viy] 

Each of the factors is in MA d so all the coefficients of a factor have the same 
sign. □ 

The following might be true for arbitrary multiaffine polynomials with 
positive coefficients. 

Lemma 11.66. Polynomials with positive coefficients in MA 2 are closed under 
Hadamard product. 

Proof. We know a+bx+cy + dxu is in MA2 iff | \ " | ^ 0. If a, b, c, d are positive 
then I ^ £ I is positive semi-definite. Since positive semi-definite matrices are 
closed under Hadamard product, the conclusion follows. □ 
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11.7 Multiaffine polynomials with complex coefficients 

Multiaffine polynomials with complex coefficients are much more compli- 
cated than those with real coefficients. A multiaffine polynomial with real 
coefficents determines polynomials with complex coefficients, but not all of 
them. We give this simple construction, and then restrict ourselves to multi- 
affine polynomials with two variables, where we will see an equivalence with 
certain Mobius transformations. 

Suppose that f (xi, . . . , x d ) <E MA d . If cr 1; . . . , cr d lie in the upper half plane 
then f(xi + cr 1; ...,x d + cr d ) is multiaffine, has complex coefficients, and is 
non-vanishing for x e UHP. 

So, assume that f (x,y ) = ax + (3 + yyx + 6y. Solve f (x,y ) = for y: 

ax + 13 

yx + 6 

If M is the Mobius transformation with matrix s ) then y = — M(x). If 
x€ UHP and f e H P 2 then y e C — U H P, for otherwise we have a solution with 
both x,y in the upper half plane. Thus, x E UHP if and only if M(x) e UHP. 
Consequently, 



f is complex multiaffine ^> M : UHP — > UHP. 

For the rest of this section we determine conditions on a Mobius transfor- 
mation to map UHP — ► UHP. 

If M maps the upper half plane to itself then the image of the upper half 
plane is either a half plane parallel to the real line, or the interior of a circle 
lying in the upper half plane. 

Suppose that ( " 6 ) * s a ma trix with complex entries that maps the upper 
half plane to itself. Since our entries are complex, we can divide by the square 
root of the determinant, and so without loss of generality we may assume that 
the determinant is one. In the case the image of the upper half plane is a half 
plane contained in the upper half plane we have 

where ( g £ ) is a matrix with real entries and determinant 1, and t ^ 0. 

The more interesting case is when the image of the real line is a circle con- 
strained in the upper half plane. The image of the upper half plane is either 
the interior or the exterior of the circle. We first find the center of the circle 1 

By — a6 

center = — =■ 

by — yb 



thanks to David Wright for this information. 
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The image of oo is a/y, so the radius is the distance from the center to a/y: 

1 



radius 



|23(yS)| 



The image of the circle is entirely in the upper half plane if the center lies 
in the upper half plane, and the radius is less than the imaginary part of the 
center. This gives the middle condition below. If this holds, then the image 
of the upper half plane lies in the circle if and only if is not the image of a 
point in the upper half plane. Since — (3/a maps to zero, we have the three 
conditions 



J (center) > 



a (3 
y 5 



> 1 



3(|3/a) ^ 



It is more interesting to write these conditions in terms of the real and 
imaginary parts. Write 



a - 

y ■■ 



di + xa.2 

Ci + IC 2 



p 

6 



bi + \b 2 
di + id 2 



Note that if we want to show 3[<j/t) > it is enough to show 3(crr) > 0. 
The three conditions are 



2 (biCi + b 2 c 2 — aidi — a 2 d 2 ) (— Cid 2 + c 2 di) > (center) 
— biCi — b 2 c 2 + aidi + a 2 d 2 ^ 1 (radius) 
aib 2 — a 2 bj > (interior) 

Now the determinant of M is one, which gives us two equations 



-b 2 Ci — biC 2 + a 2 di + ajd 2 
-biCi + b 2 c 2 + aidi — a 2 d 2 



(imaginary part) 

1 (real part) 

If we reexpress these in terms of determinants we get 



CLi bi 

ci di 



+ 



a 2 


b 2 




Cl 


C2 


C2 


d 2 


) 


di 


d 2 


ai 


bi 




a 2 


b 2 


Cl 


di 


+ 


C2 


d 2 


ai 


bi 




a 2 


b 2 


Cl 


di 




Cl 


d 2 



>0 



> 1 



(center) 
(radius) 
(real part) 



and these simplify to four conditions 



ai bi 
ci di 



> 1 



q 2 b 2 
c 2 d 2 



^ 



ai a 2 
bi b 2 



>0 



Cl c 2 
di d 2 



>0 
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Expressed in terms of the original coefficients this is 

|91(M)|>1 |3(M)|<0 3(a/P)<0 3(y/6) < 

Example 11.67. The Mobius transformation with matrix M = ( ^ ^ ) maps 
the real line to the circle with center 1 + 1/2 and radius 1/2. Since the solution 
to Mx = is + 1) = —1/2(1 + i), the upper half plane maps to the interior 
of the circle. The corresponding complex multiaffine polynomial is 

i + (1 + i)x + (1 + i)y + (2 + x)xy 

The four conditions are 

|9t(M)| = l |3(M)| = Im(a/P) = -1 3(y/6) = -1/2. 

11.8 Nearly multiaffine polynomials 

A nearly multiaffine polynomial is a polynomial that is degree one in all but one 
variable. 

Lemma 11.68. If yf (x) + zg(x) e P3 fen f is a constant multiple of g. 

Proof. We first remove all common factors of f and g so that it suffices to show 
that f and g are constant. Since yf + zg e P 3 it follows that f and g interlace. 
We consider the possibilities: 

First assume f < g; the case g < f is the same. If we let y = x then (xf ) + 
zg e Pi- But this is impossible, since this implies that g and xf interlace yet 
deg(xf) =2 + deg(g). 

We next assume f«g. All roots of g are smaller than the largest root s of 
f. If r > s then (x — r)f + zg e P2, and hence (x — r)f < g. But this implies that 
g has a root in [s, r], which is a contradiction. 

Thus, f and g must be constant, which proves the lemma. □ 

Corollary 11.69. Jff(x,y) = Y.tv^ii x ) e ^d+i then there are positive ol\ and 
geP such that 

f(x,y) = g(x) (aiyi H a d y d ) 

We next show that there are no interesting matrices M(x) such that ylVlz* € 
Pd for vectors y,z of variables. The only such M have the form g(x)v t ■ w 
where v, w are vectors of positive constants. 

Lemma 11.70. Zff(x,y,z) = Y.\ Vi z jfij ( x ) € P2d+i then there are g e P and 
positive on, (3t so that 



f(x,y,z) = g[x)(Y_ otiVi)(Y_ l 3 ^)- 
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Proof. Consider the matrix M(x) = (ftj(x)). For any a e E we know from 
Lemma [11.651 that M(a) has rank 1, so for all distinct i, j, k, I 

fij(a)fici(a) =fu(a)ficj(a) 



and hence fyfici = fuficj as polynomials. Now 
f'ii • M - (fnfij) =| : ] (f 



li 



fdi. 



vflc 



since fnfij = fijf ji- It follows that 

fn • f (x, y, z) = y if u ) Zif u) . 
Since each factor is in P2d+i we can use the corollary to write 

fn • f(x,y,z) = g(x)(22 KiVi)(Y_ $ iZi )- 
Clearly f n divides g, giving the result. 



□ 



We can apply the characterization of complex multiaffine polynomials to 
get properties of nearly multiaffine polynomials. 

Lemma 11.71. Suppose f (x) + g(x)y + h(x)z + k(x)yz e P3. If cr e UHP, b 2 = 



f(a) g(<j) 
h(cr) k(«r) 



and 



a [3 
Y 6 



f(a)/5 g(ff)/6 
h(a)/6 k(a)/6 



91(a) 3t(P) 
9t(y) 9t(8) 



5* 1 



3(a) a(|3) 
3(T) 5(5) 



<0 



11.9 The polar derivative and the Schur-Szego map 



The polar derivative was defined for polynomials in P^° s (See p. 13211 ) but its 
properties still hold for as well if we restrict ourselves to a single variable. 
If f(x,y) 6 Pd+i then the polar derivative with respect to y, which we shall 
denote as dy° lar , is defined as the composition 



reverse y differentiate reverse y 



gpoiar (jgfj^gg a ma p f rom p d+1 to itself since reversal maps Pa+i to itself. As 
in one variable we have the formula 

3P° lQr f = nf-ydy f 

where n is the degree of y in f . The key fact about polar derivatives is 
Lemma 11.72. Jff(x,y) e P d+1 then 
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1. d y and 9{] olar commute 

2. d y i and 9{j olar f interlace. 

Proof. Commutativity is the calculation 

dydl olar i =nf'-f'-yf" = (u-l)f'-yf" = 9£ olar 9 y 

Choose a e M and let g(x,y) = f (x,y — a). Since g e P d +i we know that its 
polar derivative is in Pd+i and therefore ng — y 3 y g e P d +i. Substituting y + a 
for y yields 

nf (x,y) - (y + a)f(x,y) = 9£ olar f - a9 y f e P d+ i 
which proves the lemma. □ 

We can apply this to derive an important theorem. We call the transforma- 
tion the Schur-Szego map. 

Theorem 11.73. //f = YJa cuy* e P(n) and £ QiWy 1 G P~d+i(n) fen 

^aii! (n-i)!gi(x) e P d 

Equivalently, the transformation y 1 x f (x)y' ^ i ^ . \ ^ ! 

I x! ( xx xj ! f ^XJ X — ^ 

defines a mapP(n) x P d+ i(u) — > P d (n). 

Proof. By the lemma we see that if f (x) e P and F(x,y ) is the homogenization 
of f then 

F(a v ,as olar ):P d+1 ^p d+1 

since 9 y + a9 y olar : Pd+i — > Pd+i and the regular and polar derivatives 
commute. It's easy to see that 

gi/gpolarxn-i , _ fi!(n-i)! i = j 

so we have 

F(9 y ,9r lQT ) Z 9iWy l = L ai9i(9^ lar ) n_ Vgi(x) 

= ^aii!(n-i)!gi(x) G P d (11.9.1) 

□ 

There are two important consequences that previously were proved using 
Grace's theorem. 
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Corollary 11.74 (Schur-Szego). I/X c^x 1 ePandY btx 1 e P V ° B then 

Y_ dibii! (u-i)!x l e P. 

Proof. Since Y bi^ 1 G P pos we know that Y bi (-xy) 1 G P2- Now apply the 
theorem. □ 

Corollary 11.75 (Borceau, Branden, Shapiro). IfY a i xl e P anc ^ 9M G P f/zen 
Proof. Apply the theorem to 

g(x+«)=^g (ll (x)|ep 2 . 

□ 

More generally, if T: P — > P2 is a map (See p. 15121 ) then we get a map 

PXP 3>P 




P X P 2 

11.10 Determining the direction of interlacing 

If all linear combinations of f and g are in Pd then either f < — g or g < — f . 
Determining the direction of interlacing is easy if the degrees are different; 
here are some ways of determining the direction when the degrees are the 
same. First of all, 

Lemma 11.76. Iff ~ g in Pa have the same degree then either f<qorq<f. 

Proof. The homogeneous part of af + g is positive for large a, and negative 
for very negative a. Thus, there is an a for which one of the coefficients of 
(af + g) H is zero, but this is only possible if (af + g) H =0. For this a define 
r = af + g. Now f ~ r, and since r has smaller degree than f we have f < r. 
Thus g = af + r, from which the conclusion follows. □ 

Lemma 11.77. If f, g e f <c g, and g <C f then g is a scalar multiple ofi. 

Proof, f <C g implies that there are a > and r such that f < r, f = ag + r and r 
has positive leading coefficient. But g^C f implies f = (l/a)g — (l/a)r where 
— r has positive leading coefficient. It follows that r = 0. □ 
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Lemma 11.78. Suppose that f ~ g in P d . If either 

V a 2 ,...,a d e R f(x, a 2 ,...,a d ) < — g(x, a 2 , . . . , a d ) 

or 

3a 2 ,...,a d 6 R f(x, a 2 ,...,a d )<g(x,a 2/ ...,a d ) 

t/ieM f < — g. 

Proof. We may assume that both f and g have degree n. If q <gc f then we know 
that 

g(x, a 2/ . . . , a d ) < f (x, a 2 , . . ., a d ] 
The hypothesis gives the other direction, so 

g(x, a 2 ,...,a d ) = af (x, a 2/ . . ., a d ) 

Since this holds for all x and at € R this implies that g = af, so f < — g still 
holds. 

If we have strict interlacing for one choice of cu then we can't also have 
g(x, a 2 , ...,a d )<f(x, a 2 ,...,a d ). □ 

We only need interlacing in one variable to determine interlacing for P 2 . 

Lemma 11.79. Iff, g£P 2 and f(x, a) < g(x, a) for all a e R then f < g. 

Proof. It suffices to show that f H and g H are scalar multiples. Geometrically 
the hypotheses say that the solution curves for g interlace the solution curves 
for f . For large positive x we therefore have f H (x) ^ g H (x) and for small neg- 
ative x we have f H (— x) ^g H (— x) which implies f H is a scalar multiple of 

g H □ 

Substitution for P d preserves the direction of interlacing but this is not true 
for P d . For example 



x 1 1 

1 V 1 
1 1 z 



2-x — y +z{xy - 1) 



so 2 — x — y<Cxy — 1. However, substitution of y — 2 and y — —2 gives 
interlacings in different directions: 

-x = (2-x-y)(x,2) > (xy-l)(x,2) =2x-l 
-x + 4 = (2-x-y)(x,-2) < (xy-l)(x,-2) =-2x-l 

In the following we let a = (ai,...,a d ) G R d andb = (b l7 ...,b d ) e (R+) d . 
From Lemma ri0.59l we know that if f (x) s P d then f (a + bt) s P. The converse 
is also true, see [18]. 
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Lemma 11.80. Iff ~ g in P d and f (T) 

where T = a + bt t/ien f < — g. 



g(T) for all a G M d and b G 



Proof. We first assume f, g G P d (n). If we take a = (0, Q2, . . ., Qd) and b £ = 
(1, e, . .., e) then f (a + tb e ] < g(a + tb e ). Since f (a + tb e ) and g(a + tb e ) also 
have degree n it follows that f(a + tb)<Cg(a + tb). Thus 



VQi G 



f (t, a 2 , . . . , a d ) < g(t, a 2 , . . . , a d ) 



sof<g. 

Now assume that f, g G P d have the same degree. Let X = xi H + x d , 

f e (x) = f(x + eX), and g e (x) = g(x + eX). We claim 



1. fe,9e G P d 

2. lim e ^ fe = f, lim e ^ 9c 

3. f e (T) g e (T). 



9- 



Given 1,2,3 then by the first part we have that f e (x) < — g e (x). All terms have 
the same degree, so using Lemma [l.42l f fx) <C q(x). 

Now 1) follows from Lemma [l0.59l 2) is clear, and 3) follows from the first 
part as follows. If 



S = a + tb + e 



^di + t^bi 



(1,1,. ..,1) 



□ 



then f e (T)=ffS) and 8e (T) = g(S) 

Interlacing in P d is the limit of interlacing in P d . 
Corollary 11.81. Iff < — g in P d then there are f n , g n such that 

2- fu,9n e Pd- 

2. f n < — g n - 

3. lim f n = f and lim g n = g. 



Proof. First assume f < g. Since f + x d+ i G P d +i there are F n G P d +i such that 
limF n = f + x d+ ig. If we write F n = f n + x d+ ig n + • • • then f n , g n satisfy the 
three conclusions. 

If f < — g then we can write g = <xf + r where f < r. Now apply the first 
paragraph to f and r. □ 



Corollary 11.82. Iff <g in P d then 



f g 

af sg 



< 0/or ; 



l,...,d. 



CHAPTER 1 1 . THE POLYNOMIAL CLOSURE OFP d 



400 



Proof. We approximate and reduce the problem to the corresponding result 
for P d . If suffices to assume that f < g and t = 1. Since f + Xd+ig € P d +i there 

are F n = f n + x d+ ig n H e P d+ i such that lim^oo F n = f + x d+ ig. Now by 

Taylor's theorem 

F n (xi +t / x 2/ ... / x d+ i) =f n + x d+ ig + t l-x d+ i- — 

\ OXi OXi 



so Corollary H0.27l tells us that 



fn gn 



< o 



and taking limits finishes the proof. 



□ 



f g 

9xi 9xi 



Corollary 11.83. 1/ f ~ g e P d and 

ro/zere 1 < i ^ d then f -c g. 

Proof. We know f <C g or g < f . If the latter holds then 



< for some xel d and some i 



f g 

df dg 
9xi 9xi 



^o 



but this contradicts the hypothesis. 



□ 



11.11 Reversal in Pa 

We use the properties of the graph of a polynomial in P d to prove that the 
reversal is in P d . 

Lemma 11.84. Iff (x) e P d (n) then the reversal f Tev is in P d (nd), where 

f rev (x) = (Xj • • • x d ) n f (-,...,—) (11.11.1) 

Proof. Define 

9e{x)=\T]{x i + eyx i )) ff 1 L. j 

Since lim e ^oge(x) = f rev (x), it suffices to show that g e € P d (nd). Now 
g e clearly satisfies homogeneity so we just have to check substitution. Let 
xi = t, X2 = Q2, . . - , x d = a d/ and set a = Q2 + ■ • • a d . It suffices to show that 
the second factor, which is 

ff 1 1 1 



t+ea' et+ea+(l — e)a2' "' et+ea+(l — e)a d / ' 
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has exactly nd roots. So, consider the curve 
1 1 



s t+ea'et + ea + (l — e)Q2'"'et + ea + (l — eja^ 

The curve 6 has singularities at 

1 - e 1 - e 

-ea,-a ci2, . . .,— a ad 

e e 

As t goes to ±oo the curve goes to 0, so there are d components in the closure 
of the image of 6. A component has two coordinates that are unbounded; 
the other coordinates only vary by a bounded amount. The curve goes to 
+oo in one coordinate, and is bounded in the others. In the other direction, 
it goes to — oo in the other coordinate, and is bounded in the remaining ones. 
Consequently, the curve is eventually above the graph in one direction, and 
below in the other, so each component meets the graph in n points. Since there 
are d components of 6 we have nd intersection points, and so substitution is 
satisfied. □ 

Using reversal, an argument entirely similar to Lemma [11-271 shows that 

Lemma 11.85. Ifi (3 x )g is in F^for all g 6 Pd then f is in Pa- 

Lemma 11.86. Suppose that f(x,y) = Y. hMv x € Pa+i or Pd+i, and the degree 
o/fi(x) is i. 

If the homogeneous parts of the ft alternate in sign then 

g(x,y,z) e Pd+e+i(n) f(x,-d y ) g(x,y,z) e P d+e+1 (n). 

Proof. If e > so that there are some z variables, then we replace f(x,y) by 
f(x + ez,y), which still satisfies the degree hypothesis. We now follow the 
proof of Lemma ITT241 □ 



11.12 Induced transformations 

A linear transformation T on polynomials in one variable determines an in- 
duced transformation defined on polynomials in two variables: 

T,(xV)=T(x l )y J . 

Conversely, we will see that polynomials in P2 can determine linear trans- 
formations on P. We have seen examples of induced transformations (See 
p. 13171 ) in Pd where the results are easier. 

We first make the important observation that if T: P — ► P then T does 
not necessarily determine a linear transformation T* : P2 — ► P2. As in Re- 
mark |9]45l we take T(g) = g(D)x and note that (but see Lemma [ll.911 



T(xy-l) = y-x^P 2 . 
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However, if T: P — ► P then the induced transformation always satisfies 
x-substitution, as the calculation below shows: 

(T,f)(x,a) = T(f(x,a)) e Psincef(x,a) e P. 

Definition 11.87. If T is a linear transformation on polynomials in one vari- 
able then we say that T satisfies induction if the induced transformation T* de- 
termines a map : P2 — ► P2. If T* : — ► P2 then we say that T satisfies 
induction on P^ 08 . 

Note that if T satisfies induction then T maps P to itself, since f G P im- 
plies T(f) = T*(f) 6 P2. Also, if T and S satisfy induction, then so does their 
composition ST. Here are two assumptions that guarantee that induction is 
satisfied. 

Lemma 11.88. Suppose that T: P — > P. If either of the these two conditions hold 
then T satisfies induction. 

1. There is an integer r such that ifl(x x ) ^ then the degree o/T(x 1 ) is i + r, 
and T(x x ) has positive leading coefficients. 

2. T(g) = -f*gwherei eP pos . 

Proof. The first part is a slightly more general statement that Theorem 19.441 
the proof is similar. In this case we have T* : P2 — ► P2. 

In the second case, suppose that h.(x,y) € P2(tx), and f € P pos (r). If r is at 
least n then the homogeneous part of T(H) has all positive terms. However, 
if r < n then T(h) H has some zero coefficients. Replace f by f e = f(x)(l + 
ex) n where e is positive. The corresponding transformation S(g) = f e (x) * g 
satisfies S*(h) S P2 since f e € P pos (n + r). Taking limits as e goes to zero 
shows that T* (h) € P2. 

□ 

We can resolve the problem with the transformation g 1— * g(D)f by intro- 
ducing a negative factor 

Corollary 11.89. Ifi[x) is in P then the linear transformation T: g h g(— D)f 
satisfies induction. 

Proof We calculate 

Since f (x + y) e P2, we can apply Lemma [11 .9 II □ 
Lemma 11.90. IfV[x,y) = Y. hMi-y) 1 is in P 2 then 
1. The linear transformation T: x x 1— > fi maps P — > P. 
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2. T satisfies induction. 
Proof. By Lemma [ll.21l the reverse of F satisfies 

Choose h(x,y) — ^~ hj(x)y x e P2 

T(hi(x)) = coefficient of y n in ru(y)F rev (x,y) 
T* (h(x, y ) ) = coefficient of z n in h(z, y )F Tev (x, z) 

Since K(z,y) and F Tev (x, z) e P3, it follows that T*(h) e P2. Finally, (2) holds 
since T satisfies induction. □ 

As is often the case, if we replace multiplication by a differential operator 
and add a constraint, we can introduce a factorial. 

Lemma 11.91. Suppose that f(x,y) = Y. fi {; X) (— y) 1 is in P2, and each ft is a 
polynomial of degree i with positive leading coefficient. 

1. The linear transformation T: x 1 1— > ft maps P — ► P. 

2. T satisfies induction. 

Proof. Choose g(x,y) = Y.gi(y)x v in P2. By Lemma 111.861 we know that 
f (x, — 9 z )g(y, z) e P3. The coefficient of z° is in P2 and equals 

□ 

We can show a linear transformation preserves all real roots by applying it 
to test functions. 

Lemma 11.92. If T z's a linear transformation on polynomials and T* (1 — xy ) n e P2 
for n = 1,2,. . . f/zen T : P — ► P. 

Proof. Let g =Y.o QiXl e ^ anc ^ calculate 

.<-.T.(>-?)*L-^'-«« T .f(^)'(;)L 

= 2_at— iT(x l ) 

i=0 

Since g(— D y ) maps P2 to itself, and evaluation yields a polynomial in P, 
we see the above polynomial is in P. Finally, taking the limit of polynomials 
yields 

lim j~_ Qi^iT(x l ) = Y_ QtKx 1 ) = T(g(x)) 

i=0 

□ 
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11.13 Linear Transformations 

One of the ways that we use to show that a polynomial has all real roots is to 
identify it as a coefficient of a polynomial in P 2 . We use this idea to show that 
some linear transformations preserve P. 

Theorem 11.93. Suppose f is a polynomial in P 2 of degree n with coefficients given 
in (|9.3.21 . The linear transformation defined byT: x 1 1— * f n _t maps P(n) to itself. 
Similarly the transformation x 1 i— > ft maps P to itself. 

Proof. Choose a polynomial g S P. Since g(y) G P2 the result of multiplying 
g(y)f(x,y) is in P2, and hence the coefficient of y n is in P. The coefficient of 
y n is exactly Tg since 

g(y)f(x,y) = o,u J ) (^fj(x)y') 

k \ i+j=k / 

If we replace g by its reverse then the transformation is x 1 1— > ft. □ 
Corollary 11.94. Iff e P t/zen i7ze map T : x 1 1— * 1— preserves roots and interlacing. 
Proof. Apply the Theorem to the Taylor series 

f(x + y)=f(x) + f'(x)y + --- 

□ 

An alternative argument follows from the observation that T(g) = 
g(D)EXP(f). Or, we could apply EXP to Theorem [1X93] 

Lemma 11.95. If¥(x,y) is the homogenization of i then define a map T: f x g 1— > 
F(— ^r,y) g- This defines a linear transformation 

P v ° s x P 2 — ► Pi- 
Proof Since f e j>v° s we can write F = FT(ociX + y ) where all at are positive. 
From Lemma [l . 651 we know that 

i 

maps P2 to itself. The result follows by letting e go to zero. □ 

A similar application of Lemma [l . 651 shows 
Lemma 11.96. Iff e P and g e P 2 then f (y - D x )g(x,y) G P 2 . 



CHAPTER 1 1 . THE POLYNOMIAL CLOSURE OFP d 



405 



Proof. If suffices to show that (y + ex — D x )h(x,y) G P 2 if h G P2 and e is 
positive. This follows from Lemma ll0.65l □ 

There is an interesting identity related to this last result: 

f(y + D x )=^f< l '(D x )g^ 

i 

Since this is linear in f only need to verify it for f = x k 

X i 

= (y + D x ) k g 

The next result constructs a transformation by looking at coefficients. 

Theorem 11.97. Choose f(x,y) G P 2 d, write f in terms of coefficients f — 
Y_i Q-iWy 1 , and let K be an index set. The linear transformation defined by 

maps Pd to itself. 

Proof. If g(x) G P d then g(x) G P 2d , and hence g(y)f(x,y) G P 2d . If g(y) = 
bjy J then we can write 

g(y)f(x,y)=^b, aitxtf+l 
ij 

The coefficient of y K in this product is in Pd, and equals 

Y_ bjdi(x) 

I+J=K 

This is exactly T(g) since 

T(g(x)) = T fZ b J x, j =Z b J a K-jM 



□ 



Corollary 11.98. Iff G Pd and K is an index set then the map 



I c) 1 

II ox 1 



defines a map Pd 1— > P d . 
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Proof. Since f(x) £ Pa we know that f(x+y) £ P2d- The Taylor series of f(x+y) 



is 



f (*+y) = Lir|? f(x) - (1Lm) 
i 

The corollary now follows from the proof of Theorem lll.971 □ 
11.14 The diamond product 

Recall (§ 1616.111 the diamond product (*) (*) = ( T * J . We show that f 0g has 

all real roots for certain sets of f and g by identifying it as a diagonal of a 
polynomial in P3. In order to do this we need an identity [178[: 

f<>g = £^+l)V DW 
* — 

i 

It turns out that the signed version of this identity also gives a bilinear 
product that often preserves roots, so we will begin with it. 

Lemma 11.99. Define a bilinear map 

S(f,9)=^(-l) i (X ^, )y P i fZ7g 

i 

If (f, g) has roots in any one of the four quarter planes below 2 (see Figure \TL2l then 
S(f,g)eP. 

(1) (-oo,0) x (-oo,-l) 

(2) (-oo,-l) x (-oo,0) 

(3) (0,oo) x (-l,oo) 

(4) (-l,oo) x (0,oo) 

Proof. We begin with two consequences of Taylor's theorem 

, v- x i D i f , 
f x + xy = V 
* — 1! 

g(x + z+xz)=> z J 



2 77rere are examples where S ( f , g ) P if i ,g have roots in any one of the shaded regions of Fig- 
ureWU\ 



CHAPTER 1 1 . THE POLYNOMIAL CLOSURE OFP d 



407 



Figure 11.2: Domains of the signed diamond product 



The signed diamond product is the signed diagonal in the product 

f (x + xy )g(x + z + xz) = (L ^ « 4 ) ' (Z ll± jf^ ^ (1L14 " 1) 

For easy of exposition we define f(x,y) € P alt 2 if and only if f (x, — y) e P2. 
The four domains of the lemma lead to the four cases: 

(a) f(xfo + i))eF*2 g((x + iHy + i)-i)eP^ 2 

(b) g(x(y + l))ePf2 f((x + l)(y + l)-l) e P^ 2 

(c) f(x(y + l))eP 2 g((x + l)(y + l)-l)eP 2 

(d) g(x(y + l))eP 2 f((x + l)(y + l)-l) eP 2 

For instance, suppose that (1) holds. Iff € P ( -°°' » thenf(x + xy) = f(x(y + 
1)) S P^j. M gM e P 1 " 00 '- 1 ' theng(x-l) S P'- 00 - ', and hence g(x + z+xz) = 
g((x + l)(z + 1) — 1) G P alt 2- Consequently case (a) above holds. Both of the 
factors of the left hand side of 1)11. 14. Hi are in P alt 2, so the product is as well. 
The signed diagonal of the right hand side of ((11 .14.1b is in P by Lemma ll3.10l 

□ 

Proposition 11.100. J/fxge (p(-l,°°) x P pos ) U (p v ° s x pi- 1 - 00 ') 
then fOg e P. 

Proof. The cases are similar to Lemma [11. 991 Suppose that f e pp° s and g £ 
p (-i,oo)_ Then we see that f (_ x ( y + 1)) g p 2 . As before g(x + z + xz) e P 2; so 

h(x,y,z) = f(-x(y + l))g(x + z + xz) e P 2 . 
The signed diagonal of h(x,y, z) is the diamond product of f and g. □ 
We now consider a different map m : P x P — > P. 

m 

Proposition 11.101. Suppose m(f,g) = f(D)g, flni T(x n ) = H n . Tften f<> g : 
P xP > P. 
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Proof. We will show that f g = (Tf)(2D)g(x) which implies the conclusion 

using Corollary [77151 and Corollary EH First of all, from (H n )' = 2nH n _i it 
follows that 

D k H n =2 k (n) k H n _ k 

and consequently 

J-HD* H n ) = 2 k (n) k x n - k = 2 k D k x n 
By linearity in and x n 

T-^D^g) = (2D) k g 
By linearity in D k it follows that for any polynomial h 

T- 1 (h(D)Tg)=h(2D)g 
and hence choosing h = Tf yields 

T _1 (Tf (D)Tg) = (Tf)(2D)g. 

□ 

If we use the difference operator A(f) = f(x + 1) — f (x) instead of the 
derivative, then we have a similar result, except that the Hermite polynomials 
are replaced with the Charlier polynomials. 

m 

Proposition 11.102. Suppose m(f, g) = f(A)g, and T(x n ) = C". Then f<> g : 
P alt x P ► P. 

m 

Proof. We will show that fOg = (Tf)(A)g(x) which implies the conclusion 
using Corollary l7.15l and Corollary l7.45l Since A(C£) = tiC"_ 1 it follows that 

A k C« = (n) k C«_ k 

and consequently 
By linearity 

T- 1 (h(A)Tg)=h(D)g 
and hence choosing h = Tf yields 

T- 1 Tf(A)Tg = (Tf)(D)g. 

□ 



T- 1 (A k qj) = (n) k x n - k = D k x n 
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A diamond-type product associated with falling and rising factorials has 
elementary proofs of its properties. (But see Question l45l) 

Lemma 11.103. If T: x n i-> (x) n and S: x n ^ (x) n then 
: P Qlt x P alt — ► P alt ro/iere (f x g) = S _1 (Tf x Tg). 

TS TS 

Proof. The proof follows from the diagram 

palt x palt palt palt 

T ^ : multiplication 

V , \ 
palt ^ f palt 

□ 



11.15 Substituting into quadratic forms 

We have seen that if f e pP° s then f(— xy) 6 ±P2- Now xy is the quadratic 
form corresponding to the matrix ( J J ) which is a limit of negative subdefinite 
matrices. Here is a generalization: 

Lemma 11.104. Suppose that the d by d matrix Q is f/ze Zz'wnY of negative subdefinite 
matrices. Iff e P pos f/zen f(-xQx t ) € ±P d . 

Proof. If we write f = YI( X + c i) where all ci are positive, then each factor 
— xQx 1 + Ci of f (— xQx 1 ) is in ±P a , so their product is in it as well. □ 

/ o 1 1 \ 

For instance, if we take Q = y l o l J with associated quadratic form xy + 

xy + zy then Q is a limit of negative subdefinite matrices since Q = J3 — I3. 
Consequently, if f (x) has all negative roots then f ( — (xy + xy + yz)) e ±P3. 

We have seen that if f (x) € P pos then f (— x 2 ) e P. Here's a different gener- 
alization to Pd- 

Lemma 11.105. Suppose that Q is an eby e negative subdefinite matrix. Iff (x) £ 
P^ os (n) f/ienf(-xQx t ,...,-xQx t ) e ±P e (2n). 

Proof. If f = Y. a i x ' then 

g(x) =f(-xQx t ,...,-xQx t ) = ^ a^-xQx^' 1 '. 

Since f € P we know that f (x, . . . , x) has all negative roots. For every cO the 
polynomial — xQx 1 + c 2 satisfies substitution. Consequently, for every root of 
f (x, . . . , x) =0 we substitute for all but one variable, and find two roots. This 
accounts for all 2u roots of g. Since the degree and positivity conditions are 
clearly satisfied for (— l) n g, we see (— l) n g S P e (2n). □ 

The only bilinear forms in P2d are products. 
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Lemma 11.106. If f = Y.t j=i a ij x i Vi G P2d f/xere are non-negative bt, Ci so 

d 

^~ aijXtyj =±(bixi H hb d x d )(ciyi H hc d y d ) 

M=l 

Proo/. We will show that all two by determinants | £j£ a}! | are zero. This im- 
plies that all rows are multiples of one another, and the conclusion follows. It 
suffices to show that | £a | = 0. If we set Xj = y, =0 for j > 2 then 

an xi yi + a i2 xi y 2 + a 2 i x 2 yi + a 22 x 2 y 2 e P 4 . 

If all of an, ai 2 , a 2 i, a 22 are zero then the determinant is zero, so we may 
assume that Qn > 0. By Lemma 111.151 we see that ai 2 , a 2 i, a 22 are non- 
negative. Substituting x 2 = y 2 = 1 and applying Corollary lll.7l shows that 
anQ 22 — ai 2 a 2 i ^ 0. If clu is non-zero then substituting x 2 = yi = 1 and using 
the lemma we conclude that Q 2 iai 2 — ana 22 ^ 0. We get the same inequality 
if a 2 i is non-zero. Consequently, the determinant is zero. □ 

11.16 Simultaneous negative pairs of matrices 

In Lemma [11.1051 we substituted the same quadratic from into a polynomial. 
When can we substitute two different quadratic forms into a polynomial in P 2 
and have the resulting polynomial be in P d ? We need a condition on pairs of 
quadratic forms. 

Definition 11.107. If p(x), q(x) are polynomials with positive leading coeffi- 
cients, then we say that they are a simultaneous negative pair if there is some 
value x such that p(x) ^ and q(x) ^ 0. If Qi and Q 2 are two matrices, 
we say that they are a simultaneous negative pair of matrices if for all vectors of 
constants a the polynomials x a Qix a 1 and x a Q 2 x at are a simultaneous negative 
pair. 

The first question to address is when does {Q, Q} form a simultaneous neg- 
ative pair? 

Lemma 11.108. Suppose that Q has all positive entries. {Q, Q} is a simultaneous 
negative pair iff Q is negative subdefinite. 

Proof. Assume that {Q, Q} is a simultaneous negative pair. We show that 
xQx' — c 2 € P d , which implies that Q is negative subdefinite. Since xQx' — c 2 
has positive homogeneous part, we need to verify substitution. Choose a vec- 
tor a and consider x a Qx a '. For large values of x this goes to infinity. Since there 
is a value of x making x a Qx a ' non-positive, it follows that for fixed a, x a Qx a ' 
takes on all positive values. Consequently x a Qx a ' — c 2 has two real zeros, and 
so substitution is satisfied. 

The converse is similar. □ 
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Proposition 11.109. Suppose that f e P^ s (n). If Q\ and Qiform a simultaneous 
negative pair of matrices then g(x) = f (— xQix 1 , — xC^x 1 ) is in ±P d [2n). 

Proof. Choose a vector of constants a and consider the parametrized curve in 
the plane given by 

6 = {(-x?Q 1 x? t ,-x?Q 2 x? t ) | xi e R} . 

Since Qt has all positive coefficients, the limit of — x^QiXf* as xi — ► oo is 
— oo. Thus, the curve C is eventually in the lower left quadrant. Since Qi, Q2 
are a simultaneous negative pair of matrices we know that 6 meets the upper 
right quadrant. It then follows that 6 meets each of the solution curves of 
f, since the graph of f meets the x axis to the left of the origin, and the y 
axis below the origin. Each solution curve yields two intersections, so we 
find all 2n solutions. As before, ±g(x) satisfies the positivity condition, so 
±g€P d (2n). □ 

Example 11.110. What are conditions on a pair of matrices that make them 
a simultaneous negative pair? Suppose Qi = ( v c t q ) and Q2 = ( ^ t g ) are 
negative subdefinite d by d matrices. 

Choose a vector a of constants. We will find the value Vi of x^Q2Xj t eval- 
uated at the smallest solution to XjQix at = 0, and the value V2 of x*Qix* 1 
evaluated at the smallest solution to x a Q2Xj 1 = 0. If for every a either Vi or 
V2 is negative then Qi, Q 2 form a simultaneous negative pair. This will be 
satisfied if we can show that Vi + V2 or Vi V 2 is never positive. 

Without loss of generality we may scale by positive values, and thus as- 
sume that b = c = 1. 



x?Q lX f = (x 1/ a)( v \v)(?) 

= x\ + 2av t x 1 + aCa* 
x?Q2X? t = (x 1 ,a)( v \-)C;) 

= Xi + 2aw t xi + aBa 1 

To find Vi, we solve for Xi and take the smaller root 



xj = — av t — y (av 1 ) 2 — aCa 1 

Since Qi is negative subdefinite we know xi is real. We then substitute into 
x^Q 2 x^ t . We then do the same for V 2 - Simplification occurs, and 



Vi + V 2 = 2(av t - aw 1 ) ((av 1 - aw 1 ) + v'afw 1 - C)a t - ^/a{ww t - B)a t ) 

Notice that if v = w then Vi + V2 = 0, so Qi, Q2 form a simultaneous 
negative pair in this case. Reconsidering our earlier normalization, this shows 
that if two negative subdefinite matrices have an equal row then they form a 
simultaneous negative pair. 
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Example 11.111. In this example we will show that xy and yz are limits of 
simultaneous negative pairs. Let Qi and Q 2 be the quadratic forms corre- 
sponding to the matrices Mi and M 2 : 



Mi = 




M 2 = 



We will show that Mi and M 2 are a simultaneous negative pair for < e < 1. 
When e = the quadratic forms are 2xy and 2yz. The determinant of Mi and 
of M 2 is (e — l) 2 e 3 (l + e), which is positiveforO < e < 1. Since the eigenvalues 
for e = .1 are (1.1,-0.9,-0.0008), we see that M^ is negative subdefinite for 
< e < 1. Notice that Qi(x,y,z) = Q 2 (z,y,x). 

If we solve for y in Qi, and substitute into Q 2 we get 



e 

(2 + 2 e - e 2 - e 3 - e 4 ) x + (e 2 + e 3 - e 4 ) z + 

2 (1 + e) v /(l-e 2 )x 2 + (e 4 -e 5 )z 2 

We next determine the sign of Vi in the xz plane. First, note that Vi (x, z) = 
has solution x = z. The remaining factor has the form ax + bz + (cx 2 + 
dz 2 ) 1/2 = 0, where a, b, c, d are positive. The solution to such an equation is 
found to be of the form x = ocz and x = (3z, . In our case, it is found that a 
and (3 are positive, a > (3, and a(3 = 1. The solution is only valid for negative 
z. Thus, the sign of Vi (x, z) is as follows. Consider the four rays in clockwise 
order around the origin: 

A: (0,0) - (oo,oo) 

B: (0,0) - (-oo,-aoo) 

C: (0,0) - (-00,-00) 

D: (0,0) - (-oo,-(3oo) 

Then Vi is positive in the sectors AB and CD, and negative in the sectors 
BC and DA. By symmetry, V 2 (x, z) = Vi (z, x), and since a(3 = 1 the roots of the 
complicated factor yield lines with the same slope. Thus V 2 (x, z) is negative in 
the sectors AB and CD, and positive in the sectors BC and DA. Consequently, 
ViV 2 is never positive. 

We conclude that for every value of x, z we can find a value of y for which 
both quadratic forms are non-positive, so Mi and M 2 form a negative simul- 
taneous pair. 
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Corollary 11.112. Ifi[x,y) G (n) then f(-xz,-yz) G ±P 3 (2n]. 

Corollary 11.113. Suppose that the linear transformation x 1 t— * on x 1 maps P v ° s to 
itself. The linear transformation 

xV h-> ai +j xV 

maps Pi^ — ► ?2- 

Proof. If f (x,y) = Y_ QijX^' then by Corollary lll.ll2l we know 

Applying the induced transformation z k ak z k yields that 

^ayt-ir^xVoi+jZ 1 ^ G±P 3 
Substituting z = — 1 shows that 

^~ aijX l u ] ai +j G P 2 

□ 

Corollary 11.114. TTze Knear transformation x T y' i— * ^ X ^.^ maps P^ 05 to ztseZ/ 

Denote by T the transformation of Corollary 111.1141 Here are some conse- 
quences: 

1. If we take f (x,y ) = (x + y + l) n then applying T yields 

ii 



ppos 



2. If we take g G P pos then g(-xy) G ±pP os . If g(-x) = £ btx 1 then 



2_% 2 i)! G ^ 



11.17 The interior of P r 



We have seen (§ 1111. 61 that the polynomials in one variable with all distinct 
roots can be characterized as the interior of P. We can extend these ideas to 
Pd- 

The set of all polynomials of degree at most n in Pd is a vector space V 
of finite dimension, and so has the standard topology on it. A polynomial 
f of degree n is in the interior of Pd, written f G int Pd, if there is an open 
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neighborhood of f in V such that all polynomials in are also in Pd- The 
boundary of Pd is defined to be Pd \ int Pd- 

We say that f < g if there are open neighborhoods of f, g such that if f is in 
the open neighborhood of f, and g is in the open neighborhood of g then f < g. 
Both f and g are necessarily in the interior of Pd- 

Surprisingly products of linear factors are on the boundary. 

n. 

Lemma 11.115. Ifn > 1 and f = ]^[(bk + Xi + a 2 kX2 + • ■ ■ + adkXd) is in Pd 

k=l 

then f zs in the boundary ofY&. 

Proof. We first make a small perturbation so that all the coefficients ay are 
distinct. Consider the perturbation 

TT 

h= (e + (bi +xi + ...)(b 2 +xi + ...)) ^Q(b k +xi + a 2k x 2 H H a d kx d ) 

k=3 

If we set all but one of x 2 , . . . , Xd to zero, then Example l9.23l shows that we can 
choose e such that the first factor does not satisfy substitution. □ 

There is an effective way of determining if a polynomial is in the interior 
of P 2 . It is only practical for polynomials of small degree. In order to do this, 
we must introduce the resultant. 

Definition 11.116. If 

f (x) = a + Qix H V a n x n 

g(x) =b + bix+ hb m x m 

then the resultant of f and g is defined to be the determinant of the n + mby 
n + m matrix 

Q CLi ... Q n 

a ai ... a n 



a ... a n 
b bi ... b m 

b bi ... b m 



bo • ■ ■ b m 

The resultant Rf of f{x,y) and fg(x,t() is a polynomial in one variable. The 
resultant has the property that Rf (a) = if and only if f(a,y) has a double 
root. 

We can use the resultant to show that a polynomial satisfies substitution. 

Lemma 11.117. If f(x,y) satisfies the following two conditions then f satisfies y- 
substitution: 
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• f (a,y) is in P for some a. 

• R f has no real roots. 

Proof. If there is an a for which f(a,y) does not have all real roots, then as 
t goes from cc to a there is a largest value of t for which f(t,y) has all real 
roots. For this value we must have that f (t, y ) has a double root, but this is not 
possible since the resultant R f has no real roots. □ 

We can extend this lemma to the case where Rf has some real roots. The 
proof is similar. 

Lemma 11.118. Suppose f (x,y ) has resultant Rf, and that the distinct roots o/Rf are 
t*i < t 2 < ■ ■ ■ < r s . Ifi satisfies the condition below then f satisfies y-substitution. 

• Set r — — oo and r s+ i = oo. For every ^ i ^ s there is an on e (ri,n + i) 
such that f(cXi,y) e P. 

Lemma 11.119. If f e P2 has the property that the resultant Res(f, f y ) has only 
simple roots then f is in the interior ofF 2 . 

Proof. The resultant is a continuous function of the coefficients, so if g is close 
to f then R g is close to Rf . Consequently we can find an open neighborhood 
of f so that all polynomials in have a resultant with simple roots, and 
exactly as many roots as Rf . Moreover, we may choose small enough that 
each interval satisfies the condition of the previous lemma. We now apply the 
lemma to conclude that every function in satisfies y-substitution. We can 
also choose small enough to insure that all its members satisfy the homo- 
geneity conditions, so that C P2. □ 

Example 11.120. Of course, the resultant must fail to show that linear prod- 
ucts are in the interior, since they are on the boundary. The resultant of 

(x+Q!+yb!) (x+ a 2 +yb 2 ) (x+a 3 +ijb3) 

is 

-bi b 2 b 3 (ai - a 2 + y bi - y b 2 ) 2 (a a - a 3 + y b a - y b 3 ) 2 (a 2 - a 3 + y b 2 - y b 3 ) 2 
and in general 

Resultant rQx+ai + biy =b a ---b n Y[ ( Q i -Qi+y(bj -bf)) 2 

\1=1 / Kj 

There are many values of y for which the resultant is 0, and they are all double 
roots, so the previous lemma doesn't apply. 
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11.18 The cone of interlacing 

Lemma ri.20l in Chapter[T]explicitly describes all the polynomials that interlace 
a given polynomial of one variable. In this section we study the collection of 
all the polynomials that interlace a given polynomial in Pd- This set forms a 
cone, and in some cases we are able to determine its dimension. 

Definition 11.121. Suppose that f e P^. The interlacing cone off is defined to 
be 

CONE(f) = {geP d |f<g}. 

We should first note that if gi, g2 6 CONE(f) and a,b > then g = 
agi + bg2 satisfies f <. g and g e P^., so CONE(f) is closed under positive 
linear combinations. Thus, CONE(f) is indeed a cone. In some cases we can 
explicitly determine the cone of interlacing. See Corollary II 1 .1231 

The dimension of a cone is the least number of elements whose non- 
negative linear combinations span the cone. From Lemma fl 1 . 1 221 we see that 
dim CONE(x n u m ) = 1. 

The first lemma is an analog of the one variable fact that all polynomials 
that interlace x n are of the form cx m . 

Lemma 11.122. Suppose that f G P2 has the property that f and x n y m interlace. If 
f has x-degree at most n, and y-degree at most m then there are a, b, c, d such that 

f = x n " 1 y m " 1 [a + bx + cy + dxy). 

Proof. For any a e R we substitute to find that x n a m and f (x, a) interlace. 
Since the x-degree of f is at most n we know that there are constants r(oc) and 
s(a) depending on a such that 

f(x,et) =r(a)x n + s(a)x n - 1 . 

However, we do not yet know that r(a) and s(<x) are polynomials in oc. If 
we solve f(l, a) = f(2, a) = for r(a) and s(<x) then we see that r(a) and 
s(oc) are rational functions of a. Thus, f (x,y) = x n_1 (r(y) + s(y)x) and so we 
can conclude that x n_1 divides f. Similarly, y 1X1-1 divides f. Since x n - 1 y m - 1 
divides f , the conclusion follows from the degree assumption. □ 

Corollary 11.123. 

CONE(x n y m ) = {ax n ~ 1 y m ~' 1 where a > 0} . 

Proof. The x and y degrees of a polynomial g such that x n y m < g must be n— 1 
and m — 1 respectively. Apply Lemma [il.1221 □ 

The next result is obvious in P. It's worth noting however that the positiv- 
ity condition is not true for polynomials in one variable that do not have all 
real roots. 3 



3 The product of two polynomials with positive and negative coefficients can have all positive 
coefficients: (2x 2 + 3x + 2) {x 2 - x + 2) = 2x 4 + x 3 + 3x 2 + 4x + 4. 
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Theorem 11.124. If fg e P2 then either f, g e P 2 or -f, -g e P 2 . 

Proof. From Lemma [l0.4l we know that f and g satisfy substitution. It remains 
to show that ±f H has all positive coefficients. We know that (fg) H = f H g H , 
and since f g € P2 we can factor 

f H g H = (fg) H =;f[( a i x + b ^ 

i 

where all at, bt are positive. It follows that f H and g H are products of some of 
the terms atx + btu, and the theorem is proved. □ 

The following lemma is harder than it seems. See the short proof in |31 ]. 

Lemma 11.125. Ifi{x,y) is a continuous function defined for all x,y with the prop- 
erty that for any oc e R both f (x, a) and f(a,u) are polynomials then f is itself a 
polynomial. 

We use the last lemma and the factorization property (Theorem 1 1 1 . 124 |l to 
determine what polynomials interlace f n . 

Lemma 11.126. Ifi, g € P 2 and f n <g then there is a gi e P 2 such f <gi and 

g r- V- 

Proof. If we substitute y — a then f(x, oc) n <J g(x, a) so from Lemma [l .201 we 
conclude 

g(x,a) = f(x,a) n_1 gi(x, a) 

where we only know that g(x,y) is a polynomial in x for any value of y. Sim- 
ilarly we get 

gO,y)=f((3,yr- 1 g 2 ((3,y) 
where g 2 (x,u) is a polynomial in y for any fixed x. Now 

g(^y) f s , A 

—r = gi(x,u = g 2 (x,u 

f(x,u) n 1 

so gi satisfies the hypothesis of Lemma [1 1 . 1251 and hence is a polynomial in x 
and y. Finally to see that f < gi note that since f n ^f n_1 gi we know that 

f n -Mf + agi) eP 2 

We now apply Theorem l 11 . 1 24l to conclude that f+agi € P 2 , and so f < g\. □ 

Corollary 11.127. CONE((x + y) n ) = {afx + y)^ 1 | a e »} 

Corollary 11.128. Ifi S P 2 then dim CONE(f) = dim CONE(f k )/or any k > 1. 
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Next, we have an analog of quantitative sign interlacing. 

Theorem 11.129. I/heP 2 ,f(x) = (x-ai) . . . (x-On.) and g(y) = (y-bi)...(y- 
b m ) satisfy f (x)g(y) <h then 

h(x,y = > Cij — (11.18.1 

* — x — Qi y — bi 

y 

ro/zere aZ7 cij arc non-negative. Conversely, ifh is given by ill. 18. li where all cy 
f/ze same sz'gn or are zero then f (x)g(y) < h(x,y). 

Proof. If we fix y then from quantitative sign interlacing (Lemma |1.20| | we can 
write 

h(x,yj = > ai y) 

* — x — ai 

and by holding x fixed 

hx,y = > Pj(x) — 

Substituting y = bj yields 

h(x,b J ) = p J (x)g / (b J ) 



so (3j (x) is a polynomial in x. Next substituting x = and then ij = bj gives 
h(a i; y) = ai(y)f'(ai) 



and so 



h(ai,bj) = ai(bj)f'(ai) 

h(x,bj) g(y) 



ui \ V~ r M*, 



g'(bj) y-bj 

^- n't 



— g'(bj) x- a t y - bj 



h(ai,bj) f(x) g(y) 



_ ^-f'(ai)g'(bj) x- ai y-bj 

We now determine the sign of Cij . The sign of h(ai, bj ) isn't arbitrary for 
we must have that f (x) <C h(x, a) for any a, and thus all the coefficients 

h(a i; bj) g(a) 

si a - " 



f'(Qi)g'(bj) a-bj 
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of must have the same sign. If we evaluate st(x) at a root b k of g we get 



, h cu,b k , h a^bic 

f'(QiJg'(b k ) r(o-i) 

Consequently sgnsi(b k ) = (— l) n+1 sgn(h(ai,b k )) since the sign of f (aO is 
(— l) n+1 . Thus the sign of h(at,b k ) is c k (— l) 1 where c k depends on k. A 
similar argument shows that the sign is also dt(— l) k , so we conclude that the 
sign is e(— 1) 1+ ' where e is a constant. This implies that the sign of 

H(ai,bj) 



f(ai)g'(b,) 

is constant, as desired. The converse follows easily. □ 

Corollary 11.130. Ifall a^bj are distinct, f(x) = (x— ai) . . . (x— a n ),and g(y) = 
(y — bi) . . . (y — b m ) then dim CONE(f (x)g(y)) < nm. 

Proof. The basis for the cone is all the products . □ 

11.19 Products of linear factors 

In this section we look at the interlacing properties of products of linear fac- 
tors. We construct polynomials in Pj 08 whose coefficients are mutually inter- 
lacing polynomials. 

Theorem 11.131. Let pi, . . .,p n and qi, . . ., q n be two sequences of polynomials 
with positive leading coefficients, and define 

n 

f = + qty) = f oM + f iMy + • • • + ^Mv n - 

i 

1. If pi < qifor i = 1, . ..,nthen h<h+ifor i = 1,. . .,n. 

2. If pi < qt in P v ° s fori = l,...,nthen ft < fi+i for i = l,...,n. 

Proof. If Pi < qi then pi + xj q t is in P 2 , and so the product F is in P 2 . The 
coefficients of F are in P, and hence adjacent coefficients interlace. 

In the second case, we know that consecutive coefficients interlace - the 
problem is determining which direction. We prove this by induction. We 
need only to look at the two leading coefficients, so let 



p = cx r + dx r_1 H 

q = ax r + |3x T_1 H 

fi = QiX n + biX^ 1 + ■ 



fi+l = a i+ ix n + b i+ ix 



n-l 
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Consecutive terms of the product (p + yq)F are 

(aai_i +CQi)x n+r + (P at_i + dai + abt_i + cbi)x n+T+1 



(a at + c Qi+i) x n+T + ((3 at + d a i+ i + abt + cbt+i) x 



The determinant of the first two coefficients is 



aat_i + c at |3at_i + den + abi_i + cbt 
aai + cat+i |3at + d ai+i + abi + cbt+i 



which can be written as 



c a 
d p 



at a i+ i 



Qi-l di 
bi-i bi 



at Qi+i 
bi b i+ i 



ac 



ai-i a i+ i 
bi-i b i+a 



The first determinant is positive since p <§; q, the second is Newton's inequal- 
ity, and the last three follow from ft <C f i+i . □ 

Corollary 11.132. Suppose that constants at, bi, Ci, di /xa^e positive determinants 
| Ci dH fln ^ ^ fl ^ aiC ^ zs positive for 1 < i < n. if 

n 

]^[(aiX + bi+ (diX + Ci)y) =f + fiu + ...f n y n 



i=l 



fo<fi< • • ■ <fr 



Proof. Ifwesetpi = diX + Ciandqi = aiX + bi then the relationship PifCqi is 
equivalent to — Ci/di > — bi/di. Since cudi is positive the condition is equiva- 
lent to the positivity of the determinant. Now apply Theorem |11.131| □ 

Corollary 11.133. Suppose that constants C\ and di satisfy 

< di < ci < d 2 < c 2 < ■ • • < d n < c n 

n 

f = n (x ^ + x + + dt) = fo(x) + • ■ ■ + f nMy n 

i 

tten (fg, . . . , f n ) is mutually interlacing. 

Proof. The determinants of the factors are | c £ | = di — Ci > 0, so we can 
apply Corollary lll.l32l to conclude that 

fo<fi< ••• <f n . 

The hypothesis are that f = nt x + di) n( x + c i) = f-n.- It follows that this 
sequence of polynomials is mutually interlacing. □ 
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Remark 11.134. The interlacings in the last three results are strict if we assume 
that the interlacings in the hypothesis are strict. 

What happens in Corollary 19.661 if some of the coefficients of x are zero? 
Theorem 111.1351 shows that we still have interlacing, but some interlacings 
change from < to <. 

Theorem 11.135. Suppose that f e P is a polynomial of degree r and let 

n n+r 
i=l i=0 

Assume that J~[{aiX + hi) has all distinct roots, all at have the same sign e, and write 
f(u)=XciU\ 

• If CoCie > then 

p <Pl< ... «Pn<Pn+l<-.-<Pn+r-l 

• J/coCie < then 

P0>Pl> ■•■ »Pn <Pn+l < ■ • ■ <Pn+r-l 

Proof. Assume e > 0. Both f (y ) and the product are in P2, so all the coefficients 
interlace. It remains to see how they interlace. The first two coefficients are 

Po ^coj^ftatx + bi) 

pi = cipo + c Y_ Y[( aiX + b ^ 

v- Po 

= CiPo + c > —r 

z — aix + b 

j 

v- 1 Po 

= C1P0 + c > ■ 

CLi X + b/Qi 

Assume that CoCje > 0, so Co, Ci ^ 0. By hypothesis po has all distinct roots. 
The coefficient Co/at has the same sign as ci, so pi ^>po- Now apply Corol- 
laryHSl The remaining cases are similar. □ 

11.20 The interlacing of products 

In one variable, every product T~[ (x + ] is interlaced by many different prod- 
ucts. In two variables, the only products J^fx+aty +bt) that are interlaced by 
products are images of one variable polynomials. In particular, most products 
in two variables are not interlaced by any other products. This is actually a 
result about lines in the plane, and the proof requires the fundamental result 
due to Sylvester and Gallai (TJ: 
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In any configuration of n points in the plane, not all on a line, there 
is a line which contains exactly two of them. 

Since this is a property of points, lines, and incidence, we can dualize: 

In any configuration of n lines in the plane, not all parallel nor all 
coincident, there is a point which lies on exactly two of them. 

Theorem 11.136. Suppose that f, g G P2(ti) are products of linear factors, f g has no 
repeated factors, and f, g interlace. Then there are interlacing polynomials h, k e P 
with homogenizations H, K, and constants <x, (3 such that either 



Proof. The graph of f consists of n lines of negative slope in the plane, as 
does the graph of g. By assumption, all these lines are distinct. If the lines 
are all parallel then we get the first case of the conclusion, and if they are all 
coincident then we have the second case. 

If we substitute any value for x then the resulting polynomials interlace, so 
we see that the intersections of the graph G of f g with any vertical line almost 
always consist of 2n points, and these points alternate between being roots of 
f , or roots of g . The same is true for horizontal lines. 

Consider a point (u, v) of the plane that lies on an even number of lines of 
G. By Sylvester-Gallai we know that there is at least one such point. Slightly 
to the left of v a horizontal line meets the lines through (u,v) in points that 
(from the left) are in f, g, f, g, . . . (say), and slightly to the right of v a hori- 
zontal line meets the lines through (u, v) in g, f, g, f, Since a horizontal line 

meets all the lines of G, we get the important conclusion that on the horizontal 
line through (u,v) all the lines of G must intersect, and all such intersections 
have even degree. A similar statement holds for the vertical lines. Figure [TT31 
shows an example of such intersections, where solid lines are factors of f , and 
dashed lines are factors of g . 



Now, take the leftmost such vertical line: this is the line through the in- 
tersection point of G that has an even number of lines through it, and has 
smallest x coordinate. On this line, take the point of intersecting lines with 
the largest y coordinate. Label the intersection (s, t). A line of G has nega- 
tive slope, and can not meet the line y — t to the right of (s,t), and also the 



f(x,y) = h(x + ocy + |3) 
f(x,y) = H(x,«y + p) 



g(x,y) = k(x+ ay + (3) or 
g{x,y) = K(x, ay + (3) 




Figure 11.3: The graph of f g meeting a horizontal line 
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line x = s below (s,t). This implies that all lines pass through (s,t) and this 
contradiction establishes the theorem. □ 

Remark 11.137. Although the lemma shows that there are no interesting inter- 
lacing products of linear terms, there are interesting positive interlacing prod- 
ucts. Suppose that f = Oi^* + a iV + D i) i s a product in P 2 . We claim that 

fjVWj + f(x,y) + ... + f(*/D) 
x + ai + bi x + a 2 + b 2 x + a n + b n ' 

Indeed, each of them interlaces f, so any positive linear combination does as 
well. More generally, if M = I + xDi + yD 2 where Di, D 2 are positive definite 
and M[i] is the ith principle submatrix then 

|M[1]| i |M[2]| i ••• ± |M[n]|. 
11.21 Characterizing transformations 

Just as in the one variable case, we can characterize linear transformations T 
of P 2 that satisfy f < Tf . As expected, T is a linear combination of derivatives. 

Lemma 11.138. If T : P 2 — ► P 2 satisfies f <Tf for all f 6 Pi then there are non- 
negative c, d such that Tf = (c ^ + d^-)f. 

Proof. It suffices to show it holds for polynomials of degree n. Let t\, t 2 be 
positive. Since 

(t 1 x + t 2 u) n <T(t 1 x + t 2 y) n 

there is a constant cx t depending on t\, t 2 such that 

T(tix + t 2 y) n = ot(tix + tiy) n -\ (11.21.1) 

We will determine T by equating coefficients, but first we need to find oc t . 
Since T decreases degree we can write 

T(xV"" 1 ) = OixV 1-1 + btX^V^ 

Thus 

T(tjx + t 2 y ) n = tJ v T(x rt ) + nt^-h 2 Kx^y) +■■■ (11.21.2) 
= ti bnx 11 - 1 + nt^hiian^- 1 + b n ^x n - 2 y) +■■■ 
= at (tj v - 1 x n - 1 + (n - L)t^ 2 t 2 x n - 2 y + • ■ • ) 

Equating coefficients of x n_1 in (|L1 .21.L|) yields 

Y_ Qtitj-^tx^- 1 ) 

= (tibn + nb n _it2)^ ^T^tit^'TCx'y— 
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and conbined with ( (11.21.21 yields 

tib n + nb n _it 2 = 0Ct 

Substituting this into ((11 .21 . lb and equating coefficients of tjt™~ x yields 

TtxV- 1 ) = b n ix^V^ + °- n -i [n - ijxH^ 1 ^ 

= (bn^- + an-ir)"V" 
v ox ay ' 

The conclusion now follows by linearity. □ 

This holds more generally; we sketch the proof. 

Lemma 11.139. If J : Pd — ► Pd satisfies f <Tf for all f e Pj then there are 
non-negative Ct such that Tf = d J^) f 

Proof. As above, 

T (X^) n = a * (X tiXi ) n 1 

Write 

T(xl 1 ---x^)=^a i) „;ix: •••x! 1 • • • xj,-' 

Equating coefficients of x™ -1 yields 

a t = tiQ n o...o + rL ^_ tj Otv— io—i— o 

Finally, equating coefficients uniquely determines T on monomials, which 
completes the proof. □ 

11.22 Orthogonal polynomials in Ph 

There are many different definitions for orthogonal polynomials in several 
variables (see [53 . 166]). We know two infinite families of orthogonal polyno- 
mials in more than one dimension that are also in P^: the Hermite polynomi- 
als (§ 113113.51 and Appell polynomials (this section). 

Knowing that an orthogonal polynomial is in P2 allows us to recover some 
of the one dimensional properties concerning roots. Although such a poly- 
nomial does not factor into linear factors, all of its coefficients (of powers of 
either x or of y) factor into linear factors, and adjacent coefficients interlace. 

We begin by studying Rodrigues' formulas in Pd. In one variable a Ro- 
drigues' formula represents a family of polynomials by an expression of the 
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form ch. 1 D n (hp n ) where h is a function, c is a constant, and p is a polyno- 
mial. Here are three classic examples: 

Laguerre L^"' (x) 



n! e _I x a \dx 

Hermite H n (x) = (-1)^— L ( — 

e~ x " \ dx 

(-l) n 1 



2 n n! (l-x) a (l+x)P 
d 



Jacobi P£' p (x) 

y^j ((l-x) a (l + x)P(l-x 2 )-) 

The next proposition generalizes the Rodrigues' formula for Laguerre 
polynomials to define polynomials in d variables. 

Proposition 11.140. Suppose thatb — (Pi,..., 3d) and I — (ii,...,i d ) are all 
non-negative, and all entries ofa — (cti, . . . , ad) are at least —I. Iff G Pd then the 
following is a polynomial in Pd-' 

— f e - xb x a f N ) 

e -xb x a 9x 7 V I 

Proof. To reduce the necessary notation, let's assume d = 2. We can not just 
apply results about differentiation of polynomials in P2 since the objects we 
are differentiating are not polynomials when the oci are not integers and the 
3i aren't all zero. So, we enlarge our class of objects: 

P(r,s) = {e- plX - p2y x ai x r u a2 u s f | f e P 2 } 

We then are going to show that 

A:P(r,s)^P(r-l,s) (11.22.1) 
ox 

J- :P(r,s) -^P( r ,s-1) (11.22.2) 
9y 

Since 

e- piX - p2V x ai x T y a2 y s f €P(r,s) 

it will follow that 

gn+m 



dx n oy T 



which implies the conclusion. 

It suffices only to prove 1111.22. lb ; the proof of 1111.22. 21 is similar. Differen- 
tiating, 



— (e- piX - p2U x ai x T y a2 y s f) = e _PlX - piV x ai x r_1 y^y s x 
9x v ' 

dV 

-3ixf + (ai +r)f + x— 
ox , 
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Since r is at least 1, ai + r is non-negative, so we know that that — pixf + (cti + 
r)f + xf£ e P 2 , and hence 411.22.11 holds. □ 

There are several varieties of Appell polynomials. We begin with the sim- 
ple, no parameter version, and then introduce more complexity. The version 
below is in JT66J for d — 2. If I = (ii, . . . , id) and x = (xi, . . .,x&) then define 



i a 111 



AlW 4l?( (1 - Ix)|IlxI 



ii!---id!dxj 1 ---3x 1 d d 
Proposition 11.141. Aj(x) zs in Pd 



(1-xi x d ) il+ - +id xi 1 



Proof. Since (1 — I x) is in Pd, the product with x^J 1 • • • x d d is in Pd- Since Pd 
is closed under differentiation, Ai(x) e Pd- □ 

Proposition 11.142. Adjacent Appell polynomials interlace. 

Proof. Without loss of generality, we have to show that 

A tl i d (x)<Ai, i d -i(x). 

Since we can ignore constant factors, we need to show that 

3' 1 



dx\ l ■ ■ ■ 9x d d 



(l-Ix) |I| xi 1 ---x^j < 
glll-l 



( (1 - Ix) 11 '- 1 xS • • x^- 1 

Since differentiation preserves interlacing, we need to show that 

( (i - ixf . . . x 4 d * ) < ( (i - ix) X? • • • X,- 1 

Differentiating and factoring the left side yields 

(l - Ix) 11 '-^ 1 . . .x^- 1 HI|x d + U(l - Ix)) < 



l -ix) 111-1 xS-x^ 1 



^ (x - -A I A, • • ' V ( , 

Since the left side is a multiple of the right side by a linear term that is in Pd, 
the last interlacing holds, and so the proposition is proved. □ 
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The polynomials for d — 1,2 are 
n! dx n 

i an+Tn. 

The one dimensional polynomials are a modified Legendre polynomial. 
The first few of the one and two dimensional polynomials are listed in Chap- 
ter 124.91 The Appell polynomials of different dimensions are related since 
Ai, i d (xi, ...,%&) = Aij id/ o (xi, . . . , x d/ 0) . 

Next we consider Appell polynomials with extra parameters (|166|). Let 
a = [oti, . . . , otd), and define 



-OLi v -a d 



ii!---id! 9xj 1 ---3Xi d 

(1-xi x d ) ilH 



Proposition 11.143. If on ^ -1/or 1 i d Af (x) e Pd 

Proof. Since (1 — II) 111 x 1 e P^, the result follows from Proposition 111. 1401 with 
b =0. □ 

If we attempt to generalize the Rodrigues' formula for the Laguerre poly- 
nomials with the definition 

1 am 

' ,-bx„a„I\ 



xbv-a 



x a 3x 



XX 



then it is easy to see that this factors into a product of one dimensional La- 
guerre polynomials. We use Lx to force the Xi's to be dependent. The follow- 
ing polynomials are in Pd. 

e -xb x a 9x I V J 

Here is a two dimensional example that is (28) from 11661 Page 159]. Since 
xy — 1 e P 2/ the following is in P2 for b > 0: 

-b(x-Hl) 5 m (e- b(x+ ^Uu-l)-+-) 

The Rodrigues' formula for the Legendre polynomials is 

"•M-Sqafs)"'**- 11 " 



CHAPTER 1 1 . THE POLYNOMIAL CLOSURE OFP d 



428 



We can give a Rodrigues' formula for an analog of Legendre polynomials, but 
the resulting family does not appear to have any recursive properties. Let Q 
be a negative subdefinite symmetric d by d matrix, and let x = (x\, . . .,Xd]. 
For any index set I we define 

Pi(x) = !^ (xQxt-l)!'! 
ox 1 

Since Q is negative subdefinite, the quadratic form xQx 1 — 1 is in Pd, and 
since multiplication and differentiation preserve Pd, it follows that Pi(x) £ Fa- 
Remark 11.144. There is a different definition of Appell polynomials in ||53l 
that does not lead to polynomials in Pd- In the case that d = 2 these polyno- 
mials are, up to a constant factor, defined to be 

lWx,u) = (1 - x 2 - J£j^ (1 - x 2 - vT +W - 1/2 

where [i is a parameter. Since 

U 2/0 (x,u) = - ((3 + 2 v) (1 - (2 + 2 |i) x 2 - y 2 )) 

it is clear that U2,o(x,y) does not satisfy substitution, and so is not in P2. 

When we generalize the classical orthogonal polynomials, there are many 
choices and no one way of doing it. We can notice that these generalized 
Legendre polynomials are similar to the definition of Appell polynomial in 
l53l , except that he uses a positive definite matrix (the identity) leading to the 
quadratic form (in two variables) x 2 + y 2 — 1. Replacing the identity by a nega- 
tive subdefinite matrix leads to a definition of generalized Appell polynomials 
as 

— — -9 1 (xQx 1 - 1) |I| + ^ 

(xQxt-l)* 1 

where \x is a parameter. That this polynomial is in P^ follows from the follow- 
ing theorem, whose proof is similar to Proposition II 1 . 140l and omitted. 

Proposition 11.145. Ifi £ Pd, M- any parameter, I any index set, then 

1 9 11 ' , 

f ^ 9x^ fc d 



11.23 Arrays of interlacing polynomials 

The polynomials determined by Rodrigues' type formulas form interlacing 
arrays. For instance, consider the polynomials 



CHAPTER 1 1 . THE POLYNOMIAL CLOSURE OFP d 



429 



where g e P2 is of degree 2. Since §f is a polynomial of degree 1 in P 2 we 
know that 

n+m-1 ^9 n+m-1 

9 9x- 9 



and therefore 



an+m-l pt atv+m-1 
_^ ^„n+m < _° „n+m-l 



ax n - x 9y m dx — ax n - J 9y m ' 



which implies that f n , m < f n _i, m . These interlacing take the following form, 
where h < — k stands for h < k: 



f 0,2 

A 



f 1,2 :: - f2,2 

A 



fo,l > f 1,1 > f2,l 

A A 



f 0,0 >■ f 1,0 > ^2,0 

We consider two examples for g that are of the form xQx* — 1 where Q is 
negative subdefinite. 

Example 11.146. First, take g = xy — 1. In this case we have an explicit formula 
for f n , m : 



fn,n 



9x n 9y m 



(xy-D 



n+m 



n+m 



i=max n,m 



n + m 



n+m 

z — V 1 — n, i — my 



= (n + m)! 

i=max(n,m) 

Using this formula we can verify that f n , m satisfies a simple two term re- 
currence relation 



fn,m = - ( (n + m) 2 y f n -i, m - rn(n + m) (n + m - l)f n -i,m-i ) 

If we substitute y = 1 we get an array of one variable polynomials such 
that every polynomial interlaces its neighbors. 
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x 2 x 2 6 x 3 

-2 + 4x -12x+ 18x 2 -72x 2 + 96x 3 

-12 + 18* 24-144x+144x 2 360 x - 1440 x 2 + 1200 x 3 

-72 + 96x 360 - 1440 x+ 1200 x 2 -720 + 8640 x - 21600 x 2 + 14400 x 3 

Example 11.147. Next, we consider the quadratic form corresponding to the 
negative subdefinite matrix ( 2 ? ) • If we define 

f n m = JL— (x 2 + y 2 + 4xi) - l) n+m 

' 3x n 9y m 

then we again get an array of interlacing polynomials. The degree of f n , m is 
n + m. In this case if we substitute y — then we get an array of interlacing 
one variable polynomials, but there does not appear to be any nice recurrence 
relation. 

1 4x -4 + 36x 2 

2x -8 + 24x 2 -216x + 408x 3 

-4 + 12 x 2 - 144 x + 240 x 3 432 - 4896 x 2 + 6000 x 4 

-72x+120x 3 288- 2880 x 2 + 3360 x 4 24480 x - 120000 x 3 + 110880 x 5 



Chapter 



Polynomials satisfying partial 
substitution 

In this chapter we are interested in polynomials in two kinds of variables that 
satisfy substitution only for positive values of the variables of the second kind. 
Such polynomials naturally arise from transformations that map P pos to P. 

12.1 Polynomials satisfying partial substitution in two 
variables 

We introduce the class P\,\ of polynomials f (x,y) that satisfy substitution for 
positive y. 

Definition 12.1. The class S\i consists of all polynomials f [x,y) satisfying 

1. The homogeneous part f H is in P pos . 

2. f(x,(3) e P for all f3 ^ 0. 

If all the coefficients are non-zero, then f 6 ^ P ° s . The polynomials in the 
closure of 8\i are and those in the closure of P^° s are S > \ 1 . 

We have the inclusion P2 C P-y ;1 . The next example shows that Px,l is n °t 
closed under differentiation, and so the containment is strict. 

Example 12.2. tP\ t \ is not closed under differentiation with respect to y. How- 
ever, if f G 8\j (n) then there is an a ^ such that (x, y + oc) e t?i^[n). 



431 



CHAPTER 12. POLYNOMIALS SATISFYING PARTIAL SUBSTITUTION A32 



The calculations below show that f e but that |i ^ The calcula- 
tions do show that |i(x,y + a) e P^i for a ^ 1/8. 



Every vertical line x = (3 to the left of the smallest root of f (x, 0) has n — 1 
roots of f^-(P,u). Since these n — 1 solution curves are asymptotic to lines 
whose slopes lie between the slopes of the asymptotes of f , every sufficiently 
positive horizontal line will meet all n — 1 curves. 

What do the graphs of polynomials in S\i look like? The solution curves 
do not necessarily go from the upper left quadrant to the lower right quadrant 
as is the case for polynomials is V 2 , but they can loop back to the upper left 
quadrant. 



Figure 12.1: L 5 (x;y ) and its derivative (dashed) with respect to y 

In general, since f H has all negative roots, the asymptotes of f are lines 
in the upper left quadrant with negative slopes. As x — > — oo, the solution 
curves eventually have positive y -coordinates. Thus, there is an a.\ such that 
f(a,y) e P alt for all a < <x\, and also an oco such that f(a,y) e P for all 



f 



(-3 + x + y) 2 (18 - 19x + 5x 2 - 22y + lOxy + 5y 2 ) 
2 (-3 + x + y) (51 -45x + 10x 2 -48y +20xy + 10y 2 ) 



roots of f v : 



roots of f : 
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oc < ocq. In Figure HZTI ccn is about .1, and <x\ is about —12.6. A formal proof of 
this general fact follows the argument of Theorem l9.27l 

12.2 Examples 

Polynomials satisfying partial substitution arise naturally in several ways. 

Example 12.3. The generalized Lag uerre polynomials L n (x; y ) are in P\ t \ since 
their homogeneous part is ^(x + y) n , and they they satisfy the three term 
recurrence 11681 (5.1.10)] 

nl_ n (x;y) = (-x + 2n + y - l)L n _i(x;y) - (n + y - l)L n -2(x;u) 
Lo(x;y)=l = x + y + 1 

The following lemma generalizes the fact that three term recurrences de- 
termine orthogonal polynomials. 

Lemma 12.4. Suppose that m, bj., dt, e\ are positive for i ^ 0. If 

fn(x;y) = (a n x + b n y + c n )f n _i(x;y) - (d n y + e n )f n _ 2 (x;y) 
fo(x;y) = 1 f-i(x;y) = 

then f n (x; y ) e P\ t \ fom — 0, 1, 

Proof. The construction yields polynomials that satisfy the degree require- 
ments, and the homogeneous part is Oi( a i x + bill)- If V is positive then the 
recurrence defines a three term recurrence for orthogonal polynomials, so they 
are in P. □ 

Note that if a is sufficiently negative, then f n (x; a) satisfies a recurrence 
that does not define a sequence of orthogonal polynomials, so we do not ex- 
pect any polynomials from this construction to lie in P2. 

Example 12.5. We can construct two variable Charlier polynomials from the 
one variable polynomials by 

C n (x;y) = (-irqt(-x). 

They also satisfy a recurrence relation C_i(x;y) = 0, Co(x;y) =1 and 

C n +i(x;y) = (x + y +n) C n (x;y) -ny C n _i(x;y). 

It follows from the Lemma that C n (x;y ) € ®\,\- Fi gure [T2 .2 1 shows Cs(x;y). 

If f e P pos then f (— xy) 6 P2, and consequently is in However, more 
is true: 

Lemma 12.6. J/f € P, then f (xy) e ^1,1. iff G P pos , ften f (xy) G 
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Proof. Since ^1,1 is closed under multiplication, the comment above shows 
that it suffices to show that 1 + xy e S\i . Consider f e (x, y ) — 1 + (x + ey ) (y + 
ex). For positive e the graph of f e is a hyperbola, and the minimum positive 
value is found to be d e = £t§z\ ■ Since the asymptotes have slope — e and 
— 1/e, the polynomial f e — d e meets every horizontal line above the x-axis in 
two points, and the degree and homogeneity conditions are met, so f e — d e G 
Since lim e ^o(fe — d e ) = f it follows that f e ®\,\- □ 

Example 12.7. We can construct polynomials that satisfy partial substitution 
but not the homogeneity condition in a manner that is similar to the construc- 
tion of polynomials in Suppose that A is a positive definite n by n ma- 
trix, and B is a symmetric matrix that has positive and negative eigenvalues. 
Define f(x,y) = I + yA + xB|. If y is positive then I +yA is positive def- 
inite, and so all roots of f(x,y) are real. The homogeneous part of f(x,y) is 
|yA + xB|. Since A is positive definite, the roots of f H (x, 1) are the roots of 
1 1 + xA _1 / 2 BA _1 / 2 |. Since B has positive and negative eigenvalues, Sylvester's 
law of inertia shows that A^^BA -1 / 2 does also. Consequently, the roots of 
f H are not all the same sign, and thus f ^ 

Consider a concrete example. A was chosen to be a random positive def- 
inite matrix, C was a random symmetric matrix, D was the diagonal matrix 
with diagonal (1,1,1,-1), and B = CDC. If f(x,y) = |I + yA + xB|, then 
Figure \12 .31 shows that f (x,y) ^ P2. (The dashed line is the x-axis.) 





Figure 12.2: The Charlier polynomial Cs(x;y) 
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f (x, y ) = 1 + 11 .42 x + 4.84 x 2 - 7.13 x 3 - 3.67 x 4 + 15.48 y 

+ 42.56 xy - 9.60 x 2 y- 21.88 x 3 y + 12.05 y 2 + 13.80 xy 2 
- 5.94 x 2 y 2 + 1.99y 3 + 0.96 xy 3 + 0.07y 4 




Figure 12.3: A polynomial of the form |I + yA + xB 

We summarize these observations in a lemma. The last two parts are 
proved in the same way as the results for P 2 . 

Lemma 12.8. If A is positive definite, and B is symmetric with positive and negative 
eigenvalues then f(x,y) = |I +yA + xB| satisfies 

1. f (x,y ) has asymptotes of positive and negative slope. 

2. f (x, a) e Pfor a ^ 0. 

3. f (a,y) e Pfor all a. 

4. f(ax,x) e Pfor all a. 

12.3 Induced transformations 

We now look at induced linear transformations on P pos . 

Lemma 12.9. If T: P v ° s — ► pP os f j preserves degree and the sign of the leading 

coefficient, then T* : &\T — -> & XT ■ J / T: pP ° S — * P then J * : — » 

Proof. The hypotheses imply that T*f satisfies homogeneity assumption. We 
next verify substitution for y . For |3 > we have 

(T,f(x ( y))(x,p)=T(f(x,p))eF os 

since f(x, 8) e P pos , and consequently T*(f) e ^ p ° s . The second part is simi- 
lar. ' □ 
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We can easily determine the slopes of the asymptotes of T*f . Suppose that 
f = Y. 0Li,i xt y'/ an d Ci is the leading coefficient of T(x x ). Then 

(T*f) H (x,l) = (X <H,jT(xV) H (x,D = Y. ^n-iCtX 1 

The slopes of the asymptotes are the roots of this last polynomial. 

Corollary 12.10. Suppose!: p pos — ► pp° s / J preserves degree and the sign of the 
leading coefficient. Iff e P pos then T*f(x + y) e P P ° S . 

Proof. Since f (x + y] 6 P p ° s we can apply Lemma [l2.9l □ 

Lemma 12.11. Choose a positive integer n, and define the linear transformation 
S: x 1 i—* T(x x )y n ~ x . IfT: p pos — ► pp°s preserves degree and the sign of the 
leading coefficient then S: P v ° s — ► S>\ '° s . 

Proof. If f € P pos then the homogenization F of f is in tP\ t \. Since S(f) = T*(F) 
we can apply Lemma [12. 9 1 to conclude that T*(F) S &\° s . □ 

Corollary 12.12. Iff e P pos (n) and S(x l ) = (x}^ - * then S(f) G P p ,° s . 

Proof. The map x 1 1— > (x) t satisfies the hypotheses of Lemma [12. 141 □ 

Example 12.13. If we choose f = (x — l) n then from the corollary 

i=0 v 7 i=0 v 

The last polynomial is in ^ p ° s . 

Lemma 12.14. IfT: p pos — ► pp° s efe/me a Zmear transformation S(x n ) = 

y n [T(x n )(x/y)] tfiew S: P pos — ► 8> u . 

Proof, if f (x) = £ QiX 1 is in P pos then 

TT 

S(f)(x,a) = ^a i a i [T(x i )(x/a)] 

k=0 

= T(f(«x))(x/a) 

The last expression is in P pos for positive a. Finally S(f ) H is the homogeniza- 
tion of T(x n ), and has all positive coefficients since T maps to P pos . □ 
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12.4 Applications to transformations 

The graph of a polynomial in P^i goes up and to the left. If we intersect it with 
an appropriate curve we will find sufficiently many intersections that deter- 
mine a polynomial in P. Using these techniques we can establish mapping 
properties of transformations. 

Lemma 12.15. J/f £ f(a,y) £ P for a sC 0, and f(0,y) £ P pos (n) f/zen 
f(x,x) € P pos . 

Proo/. We will show that the graph of f intersects the line y = x in n points 
that have negative x coordinate. We know that the graph of f has asymptotes 
that have negative slope, so f is eventually in the third quadrant. However, 
the assumption that f (x, 0) £ pP os means that all solution curves begin on the 
negative y axis. Consequently, all the curves must intersect the line y = x. See 
Figure El 




Figure 12.4: The intersection of f (x,y ) = and y = x. 



□ 

Lemma 12.16. Suppose that T: p pos — > p pos , where T preserves degree, and 
T(x x ) has leading coefficient a. Choose f £ p vos (n), and define S(x x ) — T(x 1 )x n_1 . 
IfL CiCkx 1 € P'- 00 -- 1 ' where f = £ c^x 1 f/zen S(f) € P pos . 

Proof. Since f e pT 108 we know that the homogenization F of f is in 8\i. From 
Lemma 112.91 the induced map T* satisfies T*(F) £ By hypothesis, the 

asymptotes all have slope less than — 1. Each solution curve meets the x-axis 
in (-oo,0) since (T*(F))(x,0) = T(x n ) £ P pos . Consequently, the graph of 
T*F meets the line x = — y in n points, and these are the roots of S(f). See 
Figure [1231 

□ 

Corollary 12.17. The linear transformation , V : x 1 i-> (x)i.x n_1 - maps p' - 1 ' to 
P pos . Similarly, U : x 1 ^ (x)^ 11 - 1 maps P (ai) (n) to P alt . 
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Figure 12.5: The intersection of (F) and y 



— — x. 



Proof. If we define T(x r ) = (x) i and F is the homogenization of f then 
T*(F)(x,x) =V(f). Iff S P (ai) (n) thenF e P 2 (n) and so from Corollary MM 
we know that T*(F) e Since (T*(F))(x,0) = (x) n the graph of T*(F) meets 
the u axis below the line x = —y. Since the leading coefficient of (x) i is 1 the 
asymptotes of T* (F) are the roots of f, which are between and 1. Thus T*(F) 
is asymptotic to lines that lie below x = — y, and so every solution curve of 
T*(F) meets x = —y above the x axis. See Figure[lZ6l □ 



Lemma 12.18. If J: P v ° s — ► P preserves degree, T(x n ) is monic, a > 0, and we 
define V(x n ) = x n [T(x n )(-a/x)] thenV : p(- Q <°> — >p. 

Proof. If we define S(x n ) = y n [(Tx n )(x/y)] then V(f) = (Sf)(-a,x). Since 
T(x n ) is monic we find that (Sf)(x,0) = f(x), so the roots of (Sf)(x,0) are 
exactly the roots of f. Consequently, the vertical line through —a is to the 
left of the intersection of the graph of Tf with the x-axis, and so meets all the 
solution curves since Sf G ^1,1 by Lemma [12. 141 □ 

Corollary 12.19. J/V : x n ^ -(* - l)(2x - 1) • • • ((n - l)x - 1) then 
y . p(-i,o) > p(o,i) r 




Figure 12.6: The intersection of T* (F) and y 



— — x. 
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The transformation x n i— > (x + l)(2x + 1) • • • ((n — l)x + 1) maps 
p«U) > p(-W)_ 

Proof. We know V : p' -1 - ' — ► p by applying the Lemma to T: x n i— > (x) n . To 
see that V has range P 10 ' 1 ' we consider the intersection of the graph with the 
line x = — y. If f = ciiX 1 and S is as in the Lemma then 

S(x n ) = (x + u)(x + 2u)---(x + uu) 

so (Sf)(x, — x) = do + aix. Since f e pv os / q and Qi are positive, and so the 
graph of Vf does meets the line x = — y once in the upper left quadrant. 

The homogeneous part of S(f ) is x(x + y)(x + 2y) • ■ ■ (x + (n — l)y), so the 
graph of S(f ) has asymptotes with slopes 1, 1/2, • ■ • , l/(n— 1), and one vertical 
asymptote. For positive y the graph of S(f ) consists of n curves starting at the 
roots of f . The vertical asymptote begins below y — — x and so meets the line 
y — —x. Thus, the other curves lie entirely below y — — x, and consequently 
the line x = — 1 meets these solution curves beneath the line y — — x, and 
hence all the roots of V are in (0, 1). 

Replace x by — x for the second statement. □ 

For example, consider Figure H2.71 the graph of Sf, where f (x) is the poly- 
nomial (x + .3) 3 (x + .9) 3 . The two dots are are at (—.9,0) and (—.3,0) which 
correspond to the roots of f . Except for the vertical asymptote, the graph of Sf 
lies under the line x = —y. The dashed line x = — 1 meets the graph in points 
whose y coordinates lie in (0, 1). 




Figure 12.7: An example for Corollary ll2.19l 
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12.5 Mobius transformations 

We use properties of the graph of T*f(xy) to establish properties of Mobius 
transformations acting on linear transformations. We know from Corol- 
lary |9]57| that if T maps P pos to itself then so does Tj/ Z . It is more complicated 
when the domain is not P pos . 

The graph of T*f(xy) is different from the graphs of Figure H2.1l and Fig- 
ure [1231 since there are vertical and horizontal asymptotes. Figure \12. 81 is the 
graph of T* (f (xy )) where T(x l ) = ( x + l ) i and f e P pos has degree 3. 



Figure 12.8: A graph of T» (f (xy )) where f e P pos 



Lemma 12.20. Let T : P po s — > pv° s -where j preserves degree, and d is the leading 
coefficient of T(x l ). For any positive integer n and f = Y_ a O<- x !n P pos (n) 

2. (T„f(xy))(x,P) eP vos forall |3 > 0. 

2. TJ(xy)eP u 

3. The vertical asymptotes o/T*f (xy ) occur at the roots o/T(x n ). 

4. The horizontal asymptotes are of the form xy = s, where s is a root ofY_ aiCi.x\ 
Proof. The first part follows from 

T*(f(xy))(x, (3) = Y_ aiTV)^ = Y_ atT(pV) = T(f((3x)). 

As the coefficient of y n is T(x n ) there are vertical asymptotes at the roots of 
Tfx™). We know that f (xy ) s &\° s so T*f (x,y ) £ &\° s . The horizontal asymp- 
totes are governed by the terms with equal x and y degrees. The polynomial so 
determined is Y_ a iCix 1 y 1 , which establishes the last part. Lemma [2.41 shows 
that all the Ct have the same sign, so s is negative. □ 
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If we assume that f e P alt then Figure [LTHl is reflected around the y axis. 
Figure 112.91 shows the hyperbola and the intersection points with the graph 
that give the roots in the next lemma. 

Corollary 12.21. Let T: P alt — > P alt where T preserves degree, and d is the 
leading coefficient ofT[x l ). For any f = Y a i x% ' n P aU suc ^ that Y Ciatx 1 - is in 
P (ai) the polynomial T 1/z (f) is in P alt . 

Proof. As usual let T*[x l yi) = T(x x )yK If we define T[x l ) — pi then 

Ti/Jf) = X atx^l/x) = X at[T,(xV)(Vx,x)] = T*(f(xy)) (l/x,x) 

Thus, the roots of T\/ z are found at the intersection of T*(f(xy)) and the hy- 
perbola xy — 1. As long as the solution curves of T*(f(xy)) go to zero 
more rapidly than 1/x then xy = 1 will intersect every solution curve. By 
Lemma 112.201 the horizontal asymptotes are of the form xy = s where s is a 
root of Y- CiQiX 1 . By assumption < s < 1, and hence Ti/ S (f) has all real 
roots. □ 




Figure 12.9: The roots of T 1/z (f) for f eP" 



Similarly we have 

Corollary 12.22. Let T: P v ° s — ► pp° s where T preserves degree, and ct z's the 

leading coefficient of T(x l ). For any f = Y CM* 1 zn P pos such that Y_ Ciatx 1 - z's in 
P { - m the polynomial T_ 1/z (f) isinP v ° s . 

Here is another consequence of changing signs. 

Corollary 12.23. Let T: p pos — ► p alt where T preserves degree, and ct z's the 

leading coefficient of T(x l ). For any f = Y a i xl ZM P alt suc ^ that Y c i a i(~ x ) 1 zs 
z'zz P m) the polynomial T 1/z (f) is in P alt . 
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Proof. If we let S (x) = T(— x) then S satisfies the hypothesis of Corollary ll2.22l 
The conclusion follows since S_y z — T\/ z . □ 

We can apply Corollary ll2.21l to the monic Laguerre and Charlier polyno- 
mials (Corollary l7.47l and Corollary [735) since all the Ci are 1. 

Corollary 12.24. If L n is the monic Laguerre polynomial then the transformation 
x i i-> Lt(a;x) REV maps P (ai) to P. 

Corollary 12.25. If C" is the Charlier polynomial then the transformation x 1 i— > 
Cf REV mapsP m] to P. 

In the falling factorial the constant terms of the factors changed. Now we 
let the coefficient of x change. Consider the linear transformation 

T(x n ) = (1 - x)(l - 2x)(l - 3x) • ■ • (1 - nx) (12.5.1) 

Lemma 12.26. The linear transformation ( |12.5.H maps p' - 11 to P. 

Proof. If S(x n ) = ( x — 1 )„ then T = S\/ x . The result now follows from Corol- 
lary IT2T91 □ 

Next we consider the Mobius transformation z — ► (1 — z). 

Lemma 12.27. Suppose that T: P pos — > P, and let f(x) = £ cux* € P pos (n). 
Define a to be the leading coefficient o/Tfx 1 ). We can conclude that ^ at(l — 
x) n ~ x T{x x ) G P any of the following hold: 

1. T(x n ) e p(-°°-D and £ HCiX 1 e p'- 00 -- 1 ). 

2. T(x n ) e P (1 '°°' and £ G P (_1 ' 0) . 

3. T(x n ) e pi- 00 - 1 ' and £ atCiX 1 G P alt . 

Proof. We need to show that each solution curve of T*(f) meets the line x + 
y — 1. Since T: p pos — > p there are solution curves for all positive y. The 
asymptotes of T*(f) are the roots of Y. o.iC\x x , and T*(f)(x,0) = T(x n ) so the 
geometry of the solution curves is given in Figure 112.101 It is clear that each 
solution curve must meet the line x + y = 1. □ 



12.6 Partial substitution in several variables 

We now generalize S\i to more variables. In short, we consider polynomials 
in two kinds of variables such that if we substitute arbitrary values for all but 
one of the x variables, and only non-negative values for all the y variables, 
then the resulting polynomial is in P. In more detail, 

Definition 12.28. The class ^ a,e consists of all polynomials f (x;y) where x = 
(xj,. . .,x d ) andy = (uj . ..,y e ) satisfying 
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l l l 

Figure 12.10: The geometry of the solution curves of Lemma [12.271 



1. The total degree equals the degree of each variable. 

2. The homogeneous part f H is in Pa+e- 

3. f(x; (3) e Pd where all |3 = (|3i, . . ., |3 e ) are non-negative. 

We define ® to be the set of all polynomials that are the limit of polynomials 
in ^d,e- The set of all polynomials in P d>e with all positive coefficients is P^° e s . 

Notice that ^ g S consists of polynomials in P d with all non-negative coef- 
ficients, and is exactly Pd+ e - We have the inclusions 

ppos _ npos r opos r r npos 

1 d+e — d+e,0 i= d+e-1,1 i= ' ' ' i= °l,d+e-l 

If d, e > 1 then substitution of positive values determines two maps 

^d° S — * ^d-i e (substituting for an x value) 
^d° S — > Te-l (substituting for a y value) 

IP d,e and ^d° e s are closed under multiplication. We can not define inter- 
lacing in ^d°e S as closed under all linear combinations, since we don't allow 
negative coefficients. If f , g e s we define f ~ g to mean that f + ocg G ^d° S 
for all non-negative a. For example, if I is linear and in ^d° s , and f € ^d° S / 
then li i f . 

A linear polynomial c+Y_ cuxi+^ b^Ui is in ^d,e if and only if it is in Pd+e- 
The quadratic case is much more interesting, and is considered in Section [l2.7[ 

It is easy to construct polynomials in &d,e by generalizing the inductive 
construction of Lemma [12.41 The proof is straightforward and omitted. 

Lemma 12.29. Suppose that at, bi, dt, ei are vectors of positive constants for i ^ 0, 
and that a can be he positive or negative. If 

f n (x;y) = (a n -x + bn ■ y + c n )fn-i(x;y) - [d n • y + e n )fn-2(x;y) 
f (x;y)=l fi(x;y) =aj -x + bi -y + ci 
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then f n (x;y) e &d,efor n — 0,1, — 
12.7 Quadratic forms 

Our goal in this section is to determine conditions for the quadratic form 
(x y) Q (x y) 1 to belong to &d,e- We first consider the case when d = 1, 
and then when e = 1. 

A matrix Q is called copositive if xQx* ^ for every vector x with all non- 
negative entries. Unlike positive definite matrices, there is no good character- 
ization of copositive matrices. 

Lemma 12.30. Suppose that Q = ( v a t £ ), where C is symmetric and a is positive. 
The quadratic form f (x;y) = (x,y)Q(x,y) t has all real roots for all non-negative y 
if the matrix v t v — aC is copositive. 

Proof. Expanding the quadratic form yields 

(*,y) ( v Q t c) (yt) =ax 2 + x(yv t +vy t )+yCy t 
Since yv 1 = vy*, the discriminant of the equation is y(v t v — aC)y t . 

□ 

Lemma 12.31. Suppose that Q = ^ where all entries are positive, A 

is a A by A symmetric matrix, and c is a scalar. The quadratic form f(x;y) 
( x y) Q (x y) t is in S>d,\ tffQ * s negative subdefinite. 

Proof. If Q is negative subdefinite then f(x,y) e Pd+i, and since all terms are 
positive it is in P d ,i- 

For the converse, we only need to show that f(x;y) e Pd+i, since this 
implies that Q is negative subdefinite. Since all entries of Q are positive, f (x,y ) 
satisfies the degree condition. Since f (x,y) H = f(x,y), it suffices to show that 
f (x, y ) satisfies substitution for any choice of values for x and y . By hypothesis, 
f(x, 1) e Pa- Now since f(x;z) = z 2 f(|,l) and f(|;l) satisfies substitution, it 
follows that f (x; z) satisfies substitution. □ 

Proposition 12.32. Suppose that Q = ^\ ^ where all entries are positive, A is 
a d by d symmetric matrix, and C is aneby e symmetric matrix. The quadratic form 



(* y)Q(; t 



is in Pd,e zjff the two conditions are met: 

1. A is negative subdefinite. 

2. uA^u* — C is copositive. 
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Proof. If the quadratic form is in Pd,e then substituting y = shows that 
xAx 1 6 P d, so A is negative subdefinite. 

If we substitute non-negative values p for y, expand and regroup the 
quadratic form, we get 

(x (3) ^ C£\ = xAx t + |3u t x t + xup* + PCP* 



,Z')(t) 027.1) 



By Lemma [12,311 f(x,z) is in P d+ i iff ( ^ ™£* t j is negative subdefinite. By 
Lemma [l0.78l this is the case iff 

< (up t )(A _1 )(Pu t ) - (PCP 1 ) = P(uA~V - OP 1 
which finishes the proof. 

□ 

Remark 12.33. Here is an alternative derivation in the case that A is 2 by 2. We 
only need to determine when the quadratic form satisfies substitution. This 
will be the case iff the discriminant, with respect to the variable X\, is positive. 
This discriminant is a quadratic form, and its discriminant, with respect to x 2 , 
equals 

a|A|y(uA-V - C)y 

The condition that this is non-positive is equivalent to the positivity condition 
since A is negative subdefinite, and therefore |A| is negative. 

Remark 12.34. Let's work out an example in light of this theory. We start with 
g(x;y) = x 2 + 2xy + y 2 + 4x + 3y + 2 

and the homogenized form is 

G(x;y,z) = x 2 + 2xy + y 2 + 4xz + 3yz + 2z 2 
= {x,y,z) (\ \ *%) {x,y,z) x 

\2 3/2 2 / 

To show that this is in S\ 2 we note A = (1), u = (1,2), C = ( y 2 3 {f J 

uA 1 u t - C = (11) - ( 3 ) 2 3 2 2 ) = (1/2 ^ 

which is copositive. Substituting z—1 shows g(x;y) e P\ t \. 

Remark 12.35. A quadratic form in P\ r \ is in Pa. To see this, let Q = (be)' anc ^ 
assume (xy )Q(xy) t £ Applying Proposition ll2.32l we see that b 2 /a — c > 
or equivalently | g jj | < 0, and hence Q is negative subdefinite. The example 
in the previous remark is a quadratic polynomial in P\ t \ \ P 2 , but it comes from 
a quadratic from three variables. 
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Lemma 12.36. Xfyi + • • - x d y d e &d,d 



Proof. We will give the computations for d — 2, and describe the general con- 
struction. If we define 



Qe = 



e 
1 



e 
1 



1 e \ 

e 1 

a oc 

ex. Oi) 



then (xi,X2 ; ui,U2)Q e (xi / X2 ; ui,'y2) t converges to xryi + x 2 y 2 - It suffices to 
show that u e A e u e has positive entries, for we just choose cc small enough so 
that u e A e u e — C a is positive. 



u 6 A„ u £ 



e(l + e)(l + e 2 



1 'I e 
e 1, 
2 + e + e 2 
1 + e + 2e 2 



1 + e- 

2 + e 



2e 2 



In general, we define J (a, b) to be the d by d matrix whose diagonal is a, and 
remaining elements are b. We let A € = T(e, e 3 ), U e = J(l, e) and C e = J(a, a). 
With some work it can be verified that U e A e ^ 1 U e has all positive entries. □ 

Note that this gives an element of £2,2 that is not in P4. If it were, we could 
substitute X2 = 1)2 — 1, but xiyi + 1 is not in P2. 

Remark 12.37. We can construct polynomials of higher degree in S\i by mul- 
tiplying quadratics. For instance, ( 1 1 e J satisfies the conditions of Proposi- 
tion [1232] for < c < 1. Ifwe choose < Cj. < 1 and let e — > + , then the 
following polynomial is in 

Yl(x 2 +2xy +y 2 +2x+ Ci ) 



Chapter 



The analytic closure of 



P d is the closure of Pd under uniform convergence on compact domains. The 

uniform closure of P^ os is P pos d - Many results about these closures follow 
by taking limits of results about polynomials. For instance, we can conclude 
from Lemma [9.321 that if f (x,y) € P2 then f(x,x) 6 P2. Similarly we can also 
extend definitions from polynomials to these closures. As an example, we say 
that f , g e P d interlace if f + ag € P d for all real a. 

13.1 The analytic closure 

The properties of Pd are similar to those of P, except that we do not have a 
characterization of the members as we do in one variable. To start off, 

Lemma 13.1. Pd consists of analytic functions whose domain is C d . 

Proof. If f (x) is in P d then it is a limit of polynomials f n (x) in Pd . If we choose 
a € R d then f (x?) is the uniform limit of polynomials f n (x?) in P and hence 
f (x?) is analytic. Consequently, f is analytic in each variable separately, and so 
we can apply Hartog's Theorem [83J to conclude that f (x) is analytic. □ 

From SectionQXHlwe know that P e C Pajtf e < d, and P^ C Pp 5 2 . The 
most important non-polynomial function in P is e x . Thus e x is in all P d- There 
is another exponential function that is in Pd- 

Lemma 13.2. Ifd ^ 2 then e~* v is in P d . Also, e xy P d . 

Proof. From Corollary 1 1 1 . 71 we know that n — xy is in P2. Writing e~ xy as a 
limit 

e -*y = lim n- n (n-xy) n (13.1.1) 

n^oo 

it follows that all the factors of the product are in P2, and hence e~ xy is in P 2 . 
If e xv e P 2 then substituting x = y yields e" 2 e P which is false. □ 
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Since i(x,y) — e~ xy is in P pos 2 we can apply Lemma Til. 41 to find that 
f(x,x) = e~* 2 is in P, as we saw earlier. Since P c P pos d we see e~ x2 is in 
P poS d- We can combine these members of P poS d to conclude 

Proposition 13.3. If H is a d by d matrix with non-negative entries then e~ xHxt is 
inPv°* d . 

Proof. It suffices to show that clijXiXi is in P pos d for all non-negative ay. 
Since 

and all the factors of the product are in p pos d it follows that the product itself 
is in P pos d, and so is the left hand side. □ 

The non-negativity of the coefficients of H is important. For instance, if 
H = (-1) then e~ xHx = e x2 and this is not in P by 

Remark 13 A. We can easily construct functions in P2. If Y. |ajj < 00 then 
the infinite product riiliU + a i z ) * s an entire function. Suppose we choose 
sequences of positive real numbers {ai},{bi} so that Y. a i an d Y. D i are finite. 
The product 

00 

f(x,y) = JJtl + aix + biv) 
1 

is an entire function since 

Y_ kkx + b t y| < |x| Y_ l a tl + \v\ Y- ' bi ' < 00 

Since f is the limit of the partial products which are in P2, f (x,y) is in P2. 

Example 13.5. We will show that sin(x) + y cos(x) £ P2, which is an exam- 
ple of a member of P2 that is not a product. We start with the two product 
approximations to sin and cos: 



fn(x)=xf[ 1 



X 2 



k 2 7t 2 

k=l 

4x 2 



QnW 



n(i 

k=0 



(2k+l) 2 7t 2 



and notice that f n+1 < g n . Lemma H 1 .281 implies that f n+1 (x) + yg n (x) £ Pi- 
Taking the limit, we find that sin(x) + y cos(x) £ P2. 

The same limit argument used in Lemma 113.21 can be applied to Corol- 
lary 1^931 
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Lemma 13.6. If I € P pos and g e P d f/zen f(-3 x 3y)g e P d . 

This has a very important corollary: 

Corollary 13.7. Iff e P d f/ien e~ 9xi 9x2 f € P d . More generally, 
e -ax. 9y .p M _^p 2d . 



Proof. Since e xy G P2 we can apply limits to Lemma ["13.61 The second part 
follows from 

e -3x 3y _ g-axjByj _ g-3x d 9y d 



□ 



Corollary 13.8. The map x k 1-* L k (xu) maps P pos — ► P 2 . 
Proof. From 1 17.10. 3b the following diagram commutes 

x n • I u' Xl). 



* n (-y) r 



n.! xn-xy 

and consequently this commutes: 

ppos 



V 

ppos 



Lemma [l3.6l has another important consequence: 



□ 



Theorem 13.9. Iff — Y. a ij~x- x y^ in ^ nen ^ ne diagonal polynomials diag(f) = 
Y_ dux 1 and diagj(f) = Y_ a a i! x% are ' w P pos . 

Proof. Since f is in P| (n) the homogenization F = ^ aijxHj'z™ -1- ' is i n 
P3. The linear transformation g 1— * e~ d * dy g maps P2 to itself, and by Theo- 
rem [I0]2l] it extends to a linear transformation mapping P3 to itself. We com- 
pute 



-3v3 



k ' 3 \ k / 3 x k 



W,k \ / \ y 



j-k z n.— i-j 



(13.1.2) 
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Substituting x — y — gives a polynomial in P 

e- 9 * 3 «F| x=y=0 = Y_ a kk (-l) k k!z- 2k 

k 

The reverse of this last polynomial is in P, so Y a kk(— l) k k!z 2k is in P. Now 
all dkk are positive since f e P^, and so Y a kk(— l) k k!x k is in P. Replacing x 
by —x, and an application of the exponential map shows that Y a a xV is also 
in P. □ 

A similar proof establishes a similar result in higher dimensions, but we 
are not able to eliminate the negative factors. 

Lemma 13.10. If f[x,y,z) — Y ftj Mt^z' zs * n Pd+2(n.) then 
L(-D k k! f k k(x) and L(-l) k f klc (x) are in P d . 

Proof. We follow the proof above and apply the operator e~ d y dz , but we do 
not need to homogenize. For the second part, apply the first part to exp(f) 
where the exponential map operates on the y variable. □ 

Remark 13.11. Here is an alternative proof that the Hadamard product maps 
ppos x ppos — y ppos It does not ghow that the domain could be enlarged to 

P x P pos . 

If f,g S P pos thenf(x)g(u) e P p 2 ° s . Applying Theorem ESI shows that 
the diagonal of f(x)g(u) is in P pos . But the diagonal is Hadamard product: if 
f (x) = Y_ diX x , g(y) =Y_ biy v then the diagonal of Y CMbjxHj' is Y CkbiX 1 . 

Corollary 13.12. IfY ft My 1 an d Y QiMy 1 are both in Pd+i then 

^(-l) l f t (x) 8t (x) e P d 

Proof. The product (^fi(x)y 1 ) gi(x)z r ) is in Pd+2- Now apply 
Lemma[l3T0l □ 

This is false without the factor (— 1)\ Just take f = g; the sum Y fi( x )gi(x) 
is positive if the constant term of f is non-zero. We note some similar results: 

Corollary 13.13. If f = Y a ii xX V^ ' s i n tfT then for any non-negative integers r, s 
the polynomial 

(i + r)!(j + s)!^ 
r!s!i! 



/ a t+r,i+s rrr. x t 1 

£. — rich! 



Proof. Instead of substituting x = y — in l|13.1.2t take the coefficient of x r y s 
and apply a similar argument. □ 

Corollary 13.14. If~f = Y a U xX V^ zs zn P^fa) ^en define 

Sk(x) = Y_ Q y xl 

i-j=k 

For — n <k<nwe have that Sk € P, and Sk, Sk-i have a common interlacing. The 
map x k I— > Sk sends P pos (n) to itself. 
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Proof. For any positive a the polynomial g = (x k + ctx k_1 )f is in the closure of 
P2° s , and diag g = Sk + ctSk-i- By Proposition ll.35l sk and Sk-i have a common 
interlacing. If h = Y_ bky k is in P pos (n) then h(y) is in the closure of P2° s (n), 
and the following calculations show that T(h) is in P. 

diagh(y)f(x,y) = diag^ b k a ij x 1 -ij ,+k 

t,j,k 

= ^b k diag^a ij x i y» +k 

k y 

= ^b k Y_ ai i xl 

k i=j+k 

= ^b k s k = T(h) 

Corollary 13.15. For any n these polynomials have all real roots: 
^-(i,i) X ?■ i!(n-2i)! 



□ 



Proof. The two polynomials in question are the diagonal polynomials of a 
polynomial in P2, namely (x + y + l) n . □ 

Remark 13.16. Here's a variation on the even part (§ 1617.15^ . If f = Y. a i x ^ is m 
P Qlt then f (x + y ) is in P^ os . Now diag(x + y ) k is either if k is odd, or ( 2 ™)x m 
if k = 2m is even. Consequently, diag 1 f (x + y ) = Y. a 2V<- x is in P. 

In the remark we considered f ( x + y ) . If we consider f ( (x + l)(y + l) ) then 
we get a quite different transformation: 

Corollary 13.17. The linear transformation x r 1-* Y-i (t) 2 * 1 nwps P alt to itself. 

Proof. If f G P alt then f(xy) G P2, and so after substituting we have that 
f ( (x + 1) (y + 1) ) G P2. If we write f = £ a i x i then 

f((x + l)(y + l)) ^attx + lHy + l) 1 

i 

diagf((x + l)(y + l)) =X a ^Z (■) xi 



which establishes the result. □ 

In § IH1.16l we saw that the partial sums of the function e x in P do not have 
all real roots, but adding another factorial lands in P. 
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x ^ 

Lemma 13.18. Iff = Y. a i~ zs zn P then for any n the polynomial 

n / \ n 

LI ) cu x l = n! > — — - — — x 1 has all real roots. 

i=0 v 7 i=0 

Proof. The operator f (3 X ) maps P2 to itself, so consider 



x=0 

i/j 



= XQ i y-i('. L )DV 

i/j 



x=0 



,n-k 



Taking the reverse establishes the lemma. □ 

We can take limits of the Hadamard product to conclude from Theo- 
remEHZl 

Corollary 13.19. Iff G P^°~ s and g e P then EXP _1 f * g is in P. 

Here's a representation of P as the image of one exponential. 
Lemma 13.20. 

P = {T(e x ) I T: P pos — > P} (13.1.3) 

pv°* = {T(e x ) I T: P pos — ► P pos } (13.1.4) 

Proof. Since T: P pos — ► P extends to T: P^ — > P, we know that T(e x ) e P. 
Conversely, choose f € P and define 

T f (g) =f *' g = f *EXP _1 (g) 

We know that Tf : P pos — > P. Since Tf (e x ) = f , we have equality. The second 
case is similar. □ 

Unlike P, factors of elements in P2 aren't necessarily in P2. A simple ex- 
ample is x = (e xy ) (xe~ xy ). Note that xe~ xu € P2, yet e xy £ P2- However, we 
do have 

Lemma 13.21. If f (x,y )e~ xy € P2, fen f (x, a) and f (a,x) are in P for any choice 
of a e M. 

Proof. If f foyje - *" e P 2 , then f (x, a)e- ax e P. Since e ax e P it follows that 
f(x,a)eP. □ 
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13.2 Differential operators 

If we substitute derivatives for some variables, the result maps Pd to itself. 
This generalization of the corresponding property for f(D) acting on P has 
many important uses. 

Lemma 13.22. Iff (x) e P d then f (-3 X ) : Pd — ► Pd 

Proof. Since e~ 9x9 >' maps P2d to itself the lemma follows from the identity 



e- 9 « 9 yg(x)f(y) 



= f(-3 x )g(x) 

y=0 



By linearity we only need to check it for monomials: 



e -3x3y x I y J 



= — n— x ■ ( 9 yVy J 

y=o J- 



-3JV 



y=0 



□ 



We use the curious equality below to derive a corollary due to Sokal and 
Lieb ITT5I . 

Lemma 13.23. If f (x,y ) is any polynomial then 

f(x,y-3 x ) =e- 9xa «f(x,u) 
Proof. By linearity we may assume that f (x,y ) = x s y r . The left hand side is 



-l) x 3lx s 



1=0 VV 

= V 1 r-i s-j t-D 1 r! s! 

* i! (T-i)I(s-i)! 

= f tW x s y r 

i=0 

which is exactly the right hand side. □ 
Corollary 13.24. J/f(x,y) G P2d ften 

2. f(x,y-3 x ) G P 2d . 

2. f(x,-9 x ) G P d . 
Proof. If we iterate Lemma [13. 23| we get 

f(xi,yi - 3xi,. ■ .,x d/ y d - 3x d ) = e^ 9 ^ ■ ■ ■ e^^f (x,y) 

The first part now follows from Corollary 113.71 If we set y = we get the 
second part. □ 
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If we substitute dxt for Xi then the result is much easier to prove. 

Lemma 13.25. If f = ^f(xi,...,Xd-i)x|j is in P d and f(0) ^ then the linear 
transformation fic(xi, ■ • -/Xd-i)^Tr(g) maps P d to itself. 

Proof. The homogeneous part of T(g) is f (0)g H , so T satisfies the homogeneity 
condition. If we choose ai, . . . , Qd-i € R then 

T(g)(ai, . . . , a d _i,x d ) = f(ai, . . ., a d -i, 9 x Jg(ai, . .., a d _i,x d ) 

and this is in P since f(aj, . . ., a d -i,x) and g(ai, . . ., a d _i,x) are in P. □ 

Next is a Hadamard product result. 

Lemma 13.26. Zf £j fjMy 1 and ^ giMy 1 are in P2 d t/ien 

^J-DMf^x) g ,(x)eP d 

J 

Proof. A calculation shows: 

X fiM(-ay) 1 ) giMy 1 ) q = J_(-^Y^ fiM gxM 

□ 

Lemma 13.27. 7/Q = (q^ ) is a symmetric d by d matrix, then e~ xQxt g P d if and 
on/y ff aZZ elements o/Q are non-negtive. 

Proof. If all elements are non-negative then it is easy to see that e~ x Q xt G P d . 
Since a — xy e P 2 for a > 0, compute 

e- (ax2+2bxy+cy2| (3 x/ 3 y ) (a-xy) = a-xy +2b 
Since this is in F2, it follows that a + 2b ^ and so b is non-negative. □ 

13.3 Limits and transformations 

If f and g are entire functions and T is a linear transformation then we define 
f (T)g to be the formal series 

00 00 

J^OibjT 1 ^] (133 - 1) 

i=0 j=0 

where f = Y.'o' a i xX an d g = Y.'o bix\ This is only a formal sum since there 
is no apriori reason why f (T)g should exist. For a full discussion of the com- 
plexities when T = aD + xD 2 see II106II . 
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We are interested in the case where f, g € P and T: P — ► P. We assume 
that f n — * f and g n — > g where convergence is uniform convergence on com- 
pact subsets. The problem is then to determine when 

lim f n (T)g n exists and equals f (T)g. 

TI— »00 

The simplest case is when all limits involve polynomials. 
Lemma 13.28. If 

1. T decreases degree. 

2. h(T) : P — ► Pfor all he P. 

3. f eP 

then f[T): P — ► P. 

Proof. If f n — > g and g G P(tti) then both f n (T)g and f (T)g are polynomials 
since T k (g) = for k ^ m. Since the coefficients of f n converge to the co- 
efficients of f we see that f n (T)g — > f(T)g. By hypothesis all f n (T)g e P, so 
f(T)geP. □ 

Example 13.29. For example, since f (D) maps P to itself it follows that e D : 
P — ► P. Of course, this is trivial since e D f (x) = f(x + 1), but the same argu- 
ment applies to e~ 9xdl1 (see Corollary 1 13.71 1. 

In order to work with entire functions we recall Montel's theorem: 

Any locally bounded sequence of holomorphic functions fn de- 
fined on an open set D has a subsequence which converges uni- 
formly on compact subsets to a holomorphic function f . 

It follows that if T is an operator that takes bounded sequences to bounded 
sequences and if f n — ► f then Tf n — ► Tf . For instance, the k'th derivative is 
a bounded operator by Cauchy's integral formula: 



f (k) (a)~ 
2m 



f(z) A 



c lz- 



where C is a small circle centered at a. More generally, if f [x,y) — Y_ fiMy 1 
is a polynomial in two variables then the operator 

T:gi-> fi(x)D l g 

is a bounded operator since it is a sum of products of locally bounded func- 
tions fi times bounded operators. We therefore have 

Lemma 13.30. If T is a differential operator as above then limn^oo T(l — 2 n i ) n 
exists, equals T(e~ xy ), and is an entire function. 
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In order to prove that sequences are uniformly bounded we use the 
bounds on the size of a polynomial. 

Lemma 13.31. Suppose that T is a linear transformation and 

1. T* : P 2 — ► P 2 

2. T(l) G R\0andi(0) ^0. 

3. f u — > f in P 2 where f n € P 2 . 

tflCTlT.(fn) — »T,(f). 

Proof. Write f n = £ t F^My* and f = £ t F t (x)y\ Now Tf n = £ TfF^y 1 is 
in P 2 by hypothesis, so for any a G R we have (Tf n )(ct,y) G P and therefore 
we can apply Lemma [4. 161 



sup(Tf n )(a,y) < T(F n , )exp( r ^^—^ +2r z 



(TF n4 (a) ) 2 . ,TP n . 2 («| 
(TF n , )2 



+ r 



TfF 



n,0j 



If we define 



a =infT(F Tl/ o) 

n 

B r = sup T(F n , 1 (a)) 

n,| oc|^t 



A =supT(F n , ) 
C r = sup T(F n , 2 (a)) 

n,| a|^r 



then A, B r/ C r are finite and a is non-zero by hypothesis so 



B D~ A 

sup (Tf n )(a,y) < Aexpl r- +2r 2 ^- +r 2 - 



laK^lyl^T 

Thus T(f n ) is uniformly bounded and therefore converges to T(f ). 



□ 



13.4 Totally positive functions 

We raise some questions about totally positive functions and '■ 

Definition 13.32. A function f (x,y) is strictly totally positive if for every posi- 
tive integer n and sequences xi < x 2 < • • ■ < x n and yi < y 2 < ■ • • < y™ the 
determinant 

f(*i,yi) f(xi,y 2 ) 



f(*2,yi 



f(x n/ yi 



f(xi,y T 



f(x n ,y T 



(13.4.1) 



is positive. If the sequences and {y^} are restricted to positive values then 
we say that f (x,y ) is a strictly totally positive function on R 2 ^. 



CHAPTER 13. THE ANALYTIC CLOSURE OF P d 



457 



If we take n = 1 we see that a necessary condition for f (x,u) to be strictly 
totally positive on is that f [x,y ) is positive for x and y positive. A natural 
question (|106l l is to determine conditions on polynomials g(x,y ) so that 1/g is 
strictly totally positive on . 

We know three functions for which this holds. It is well known that e xy is 
strictly totally positive, and e xy is of the form 1/g where g = e~ xu S P2- The 
simplest function in Pj 08 is x + y; this is strictly totally positive on M. 2 ^ . That 
this is so follows from a famous evaluation. Recall Cauchy's double alternant: 

l^i<Kn 

n 

The numerator is positive for any pair of strictly increasing sequences; the 
denominator is positive since all xt and Uj are positive. 

For the last example, there is a quadratic analog (|13.4.2|l of Cauchy's for- 
mula due to Borchardt[159] where perm is the permanent. Since the perma- 
nent of a matrix with all positive entries is positive, it follows that (x + y)~ 2 is 
strictly totally positive on . 



det (-JLJ)- 



det (- = det ( — - — ] perm (— — ] (13.4.2) 

i<y<n \{Xi +yj) 2 J lmA^n \xi +yj J i^i, Krt \Xi +y } J 

If h(x) is any function that is positive for positive x, and f(x,y) is totally 
positive on then h(x)f [x,y) is strictly totally positive on M^_. This follows 
from simple properties of the determinant: 

det (h.(xi)f(xi,-yj)) = h(xj) • • • h(x n ) det (f(xi,Uj) > 

Consequently if g £ P2 satisfies the property that 1/g is strictly totally 
positive on then so does e~" 2 g. 

It is not true that 1/g is positive semi-definite if g € P^ 08 - see fT2l . 
We need a few properties of positive definite matrices |95[. 

Definition 13.33. Let A, B be Hermitian matrices. We write B -< A if the matrix 
A — B is positive definite. If A — B is positive semidefinite we write B ^< A. 
The relation ^< makes the set of all Hermitian matrices into a partially ordered 
set. 

Here are some properties of -< that we will use. 

1. A -< B implies det(A) < det(B). 

2. A -< B implies B" 1 -< A" 1 . 

3. A -< B and C positive definite implies A + C -< B + C. 
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4. A -< B and C positive definite implies CAC -< CBC. 

If f (x) is a positive definite matrix for all x in an interval on the real line, 
then we say that f is increasing if a < b implies f (a) -< f (b). f is decreasing if — f 
is increasing. 

Lemma 13.34. If A and B are positive definite matrices, and < xi < x 2 , < yi < 
y 2 then 

1 1 



|I + xiA + yiB| 
1 



\l + x 2 A+ Vl B\ 
1 



|I + xiA + -y 2 B| |I + x 2 A+y 2 B| 



>0 



Proof. If we let C = x 2 A, D = y 2 B, x = x\/x 2 , y = y\/y 2 then < x,y < 1 and 
C, D are positive definite. We need to show that 



1 



|I + xC + ijD| 
1 

|I+xC + D| 



\l + C+yD\ 
1 

|I+C + D| 



>0 



Since all the matrices are positive definite the determinants of positive linear 
combinations are positive. Thus we need to show that 1 

|I + xC + D||I + C + yD| > |I + xC + i)D||I + C + D| 
which is equivalent to 



II + xC 



II + xC + DI 



yD| < |I + C+yD| 



|I + C + D| 



If we denote the left hand side by F(x) then we will show that F(x) is an in- 
creasing function. 



I + xC + D is increasing 
(I + xC + D) _1 is decreasing 
D 1/2 (I + xC + D) _1 D 1/2 is decreasing 
(y - 1)D 1/2 (I + xC + D) _1 D 1/2 is increasing 
I + (y - 1)D 1/2 (I + xC + D) _1 D 1/2 is increasing 

The determinant of the last expression is F(x) - this follows from the identity 

|(aA + B)B 1 = |I + aA 1/2 B _1 A 1/2 | 

□ 



thanks to Han Engler for this argument. 
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13.5 Hermite polynomials 

We use generating functions to define Hermite polynomials in d variables. 
The main result is that they are in P d . 

Let S be a n x n Hermitian matrix, y = (yi, . . .,y n ) and x = (xi, . . .,x n ). 
The Hermite polynomials determined by S are defined to be the coefficients 
of the exponential generating function 



where I = (ii,. . .,i n ), I! = ii! ■ ■ -in!/ y 1 = v\ x ■ ■ -y]?, and z* is the conjugate 
transpose of z. The factor 2ySx* has a negative sign, so we can use Proposi- 
tion ll3.3l to conclude that f s 6 P2d- 

If we take n = 1 and S = (1) then the exponential is e~ yi ~ 2xiUl which is 
exactly the generating function of the one variable Hermite polynomials. 

If we multiply l|13.5.1t by exp( — xSx* ) then 



We use this to derive the Rodrigues' formula for the Hermite polynomials. 
The Taylor series of g(x) asserts 




(13.5.1) 





so if we choose g(x) = exp(— xSx*) then 




Hi(x) 



(-y) 



I! 



Equating coefficients of y yields the Rodrigues' formula 




If we rewrite this as 




then it follows that Hj(x) satisfies substitution. 
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In order to show that Hi is in P n it suffices to determine the homogeneous 
part of Hi, and we use the Rodrigues formula for that. We first note that if et 
is the coordinate vector with a 1 in the i-th place, and elsewhere then 

Q 

- — xSx* = (eiS + Se- )x. 

OXi 

Consequently, if we differentiate g(x) exp(— xSx 1 ) where g is some polynomial 
then 

g 

- — g(x) exp(— xSx*) = kfx) exp(— xSx t ) 

where k(x) is a polynomial, and moreover 

k(x) H = ((eiS + Sel)x)g(x) H . 

Upon writing 




exp(— xSx 1 ) = j(x) exp(— xSx 1 ) 



we conclude that the homogeneous part of ) is 

By assumption all coefficients of S are positive, so all the coefficients of j H 
are positive, and by the above it follows that the homogeneous part of Hi has 
all positive coefficients. This proves 

Theorem 13.35. If S is a symmetric matrix with all positive coefficients then the 
Hermite polynomials determined by 013.5. lb are in 

13.6 Hermite polynomials in two variables 

The results of this section exist simply to explain properties that can be ob- 
served in a particular graph of a Hermite polynomial in two variables. If we 
take Q = (\\) and I = (2,3) then 

H 2 , 3 = -16 (108 x - 207 x 3 + 54 x 5 + 114 y - 876 x 2 y + 432 x 4 y - 990 x y 2 
+ 1206 x 3 y 2 - 332 y 3 + 1420 x 2 y 3 + 744 xy 4 + 144 y 5 ) 

The homogeneous part factors into —32 (3 + x) 2 (2 + 3 x) 3 , which explains the 
asymptotes in Figure [T3. II The rest of the graph of ^3 is quite surprising: 

The following result shows that all the Hy are in P| en , but doesn't explain 
the striking regularity. 
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Figure 13.1: The graph of H 2 ,3 



Lemma 13.36. If S is not a diagonal matrix then Ry e P 2 • 

Proof. Let S = ( £ £ ) where b ^ 0, and set Q = ax 2 + 2bxy + cy 2 . Suppose 
that Hij has intersecting solution curves. If so, then Hy and ^rHy have a 
common zero. Now since 

— H- - — (e Q 3 1 3' e- Q ) 

= e Q 3i +1 3^ e-Q + Q x eQ a£ 9^ e« 
= Hi+ij + Q x Hi.j 

it follows that Hy and Ht+y have a common zero. Since e Q has no zeros, both 
d\ dy e Q and 3 X +1 3 y e Q have a common root. Since 3 X d y e Q < 3 X +1 d y e Q , we 
conclude from Corollary 19.421 that 3 X _1 3 y and 3 X 3 y e<3 have a common 
root. Continuing, we see 3 y e Q and 3 X 3 y e Q have a common root. Eliminating 
derivatives of y, we conclude that 3 y e Q and 3 X d y e Q have a common root. 
Because Hio = — 2bx — 2cy, we can solve for x and substitute into 

H u =2 (-b + 2abx 2 + 2b 2 xy + 2 acxu + 2b cy 2 ) 

which yields —2b. Since b ^ 0, there are no common roots. □ 

We follow Nunemacher in finding the exact equations of the asymptotes. 

Let 



Q = Q x 2 + 2bxy + cy 2 



H, 



Hu,m+1 
Hn+l/m 



_~ax 2 +2bxy + cy 2 



3 

3xy \dy 
-2(bx + cy)H n , m + 3 y H n/m 
-2(ax + by)H n , m + 3 x H n<m 



e -(ax 2 +2bxv + cij 2 
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From the recursions it easily follows that the homogeneous part of H n m is 
(— 2) rl+m (ax + by) n (bx + cy) m . Consequently, there are n asymptotes in one 
direction, and m in another. We proceed to find the precise equations of these 
asymptotes. It's easy to see that we can write 

H n , m = (ax + by) n Q n , m (x,y) + (ax + by) n - 2 Q n _ 2 , m (x,u) + • • • 

H n ,m+i = (ax + bu) n Q n ,m+i(x,y) + (ax + by) n ~ 2 Q n _ 2 ,m+i(x,u) H 

= -2(bx + cy) [(ax + by) n Q n , m (*,y) + (ax + by) n - 2 Q n _ 2 , m (x,y) + • • • ] 
+ 3„ [(ax + by) n Q n , m (x,y) + (ax + by) n - 2 Q n _ 2 , m (*,y) + • • • ] 

Now when we substitute x = b,y — — a, all terms with a factor of (ax + by) 
vanish, so equating terms of like degree yields 

Qn,m+i(b,-a) = -2(bx + cy)Q n , m (b,-a) 
Qn-2,m+i(b,-a) = -2(bx + cy)Q n _ 2 , m (b,-a) 

Thus we see that 

Qn-2r,m(b,-a) = (-2(bx + cy ) ) m Q n -2r,o (b, -a) 

The asymptotes B132I are ax + by = a where a is the root of 

t n Q n ,m(b, -a) + t n - 2 Q n _ 2 , m (b, -a) + • • • 
= (-2(bx + cy )) m [t n Q n , (b, -a) + t n - 2 Q n _ 2 , (b, -a) + • • • ] 

and so the asymptotes of H n/m . and H n/ o are the same. But, 

H n , (x,y) =-e Qx2+2b ^ (J^Y e - {a ^ +2h ^ ) 

consists of parallel lines, so we can substitute y = to see that the asymptotes 
ax + by = a to H n/in satisfy H n; o(a, 0) = 0. Since 

H n ,o(x,0) =e 

is the Hermite polynomial H n (x/ yfa) we conclude: 
Proposition 13.37. The asymptotes o/H^ m are of the form 

ax + by = a bx + cy = (3 
where H n (oc/^/a) — and H m {fi/^/c) —0. 
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The coefficients of some Hermite polynomials obey a recursion similar to 
the three term recursion for orthogonal polynomials. Choose a positive sym- 
metric matrix S, fix a positive integer n, and define 



H n M=e* s *' (^-) e- xS ' 



= ^ft(x 1/ ...,x d _i)x 1 d 



i=0 

If we let v = (vi, . . . , v d ) be the first row of S, then it easy to see that 

H n +i = -2(vixi + h v d x d )H n - 2nv! H n _i 

We can use this recursion to prove by induction on n that 



fk = ( Y_ — r — Xi ) fk+i - ( ' 



n + l)(n + 2) vA 
2(n-k) v^J fk+2 

If d = 2, and S = ( £ b c ) then 

/k+1 q\ /(k+lKk + 2) a\ 

fk(x) = ln-k b J Xfk+l(x) - { 2(n-k) J fk+2(x) 

We get a three term recursion if we consider H n i instead of H n o, but the 
coefficients of H ni2 have a five term recursion that includes x 2 f n+2 and xf n+3 . 



Chapter 



Extending P d 

The goal in this Chapter is to investigate polynomials that only satisfy substi- 
tution for some values of y. There are several different approaches. 

14.1 Taking the image of P2 

We define the set ^2 of polynomials as images of polynomials in P2. 
Lemma [13.201 provides the motivation for our analogues, as it shows that we 
can define P in terms of linear transformations. Recall its results: 

p ={T(e x ) I T: P pos — ► P} 
ppos _ {T(e") I T - p pos > pp° s } 

We now consider two variables. 
Definition 14.1. 

<p 2 = {T* (f) I f e P^° S AT: P pos — >PAT satisfies induction} 
<p pos = {T, (f ) I f e PP° S AT: P pos — ► P pos A T satisfies induction} 

= closure of CP2 
<p pos = closure of CP2 

The following lemma enumerates basic properties of < $2- 
Lemma 14.2. Suppose g — T*f is in ^2- Then 

1. P P 2 0S C <p POS C <p 2 £ P u . 

2. g H (x,l) e P 

3. g(x,cc] e P vos for<x^ 0. 

464 
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4. |a e <p 2 



6. The coefficient of any power ofy is in P v ° s . 

7. Ifh{x) € P pos (n) the the homogenization H = y n h(x/y) is in ^2- 

8. IfS: P pos — > P pos then S* : <p 2 — * ^2- 

if we assume that g e 'Pj 08 ffaen parfs (4), (5), and (8) Wrf with ^2 replaced by 
«P POS and m (2) r<;e toce that g H (x, 1) e P pos . 

Proof. If we take T to be the identity then P? C <p 2 - The example below 
shows that this is a proper containment. Since g H = T*(f H ) we see g H (x, 1) e 
P. If a > then f(x, a) e P pos , and so g(x, a) = T*f(x, a) e P pos . If we define 
S(h) = £ T(h) then S: P pos — » P pos , and so |f e Next, |^ = T.(f*), 

and so e ^2 since |g S Pj 08 . Since we can differentiate with respect to y, 
and substitute y = 0, we see that all coefficients of powers of y are in p pos . 
The coefficients of H are all positive, and substitution is clear. The last one is 
immediate from the definition: S*(T*f) = (ST)*f. 

□ 

Example 14.3. Members of *p pos do not necessarily satisfy substitution for 
negative a. Consider T: x n 1— > (x) n and f = (x+y + 1) 2 € ¥^ s . If g(x,y) = T* f 
then g e ^2, yet g(x, — 2) = x 2 — x + 1 has complex roots. 



Constructing elements in ^2 

We can construct elements of ^2 by explicitly exhibiting them in the form T*f 
as in the definition, or by a general construction. 

Example 14.4. The generalized Laguerre polynomials Ln.(x;y) are in <p pos . 
This is a consequence of the surprising identity ( |14.1.H . If we define T: x n 1— > 
(x) n then 

V 1 ( x + y + 1 )^ = n!L n (x;y) (14.1.1) 

Now T- 1 maps P pos to itself and ( x + y + 1 )^ e P pos . It follows from (114.1.1b 
that L n (x;y) S '^ os . I don't know a good proof for 1 114.1.11 ; there is an unin- 
teresting inductive proof. 

Example 14.5. We claim xy — 1 e ^2. First of all, the transformation T : x n 1— > 
H n (x/2) maps P to itself, and satisfies T(l) = l,T(x) = x,T(x 2 ) =x 2 — 2. Now 
choose a positive integer m, and define S(f ) = T m (f ). Then S : P — ► P and 

S(l)=l S(x)=x S(x 2 )=x 2 -2m 

Set e = l/(2m), and define 

f m = S*(x+ ey)(y + ex) 

= xy(l + e 2 ) + ey 2 + ex 2 - 1 



CHAPTER 14. EXTENDING P d 



466 



then 

lim f m = xy — 1 

m — >oo 

A similar argument shows that xy — a G ^2 for any positive a. Notice that 
we are using a limit of transformations to show that an element is in < $2, not a 
sequence of elements. 

Example 14.6. The map T: x i h-> (x) i satisfies P pos — ► P pos . If f(x) e P pos , 
then the homogenization of f(x) is in P2. Consequently, if we define S(x x ) — 
(x)iy n_i thenS(f) e <# v 2 os . 

Example 14.7. We can use transformations to get elements of ^2 or <p pos 
from elements in p pos . Suppose that T is a linear transformation satisfying 
T: P pos — > ppos^ anc j S is a linear transformation then the composition be- 
low gives elements of ^2 from elements of P pos . 

ppos -A^p 2 _ 

Useful choices of S are 

f(x) 1 * f(x + y) 

f (x) 1— > homogenization of f(x) 

In the latter case the composition is x x 1— > T[x l )y n ~ x where n is the degree of 
f. 

The graphs of polynomials in ^2 

The graphs of polynomials in CP2 resemble polynomials in P2 in the upper 
left quadrant, since they satisfy substitution for positive y, and their homoge- 
neous part has all negative roots. Here are two examples. Figure [12.11 shows 
the graph of Ls(x;y). In general, the Laguerre polynomials L n (x;y) have a 
series expansion l|7.10.2t that shows that V n {x;y) is a polynomial of degree n 
in both x and y, with all positive terms. The leading term of the coefficient of 
x is 

(n-k)!k! 

and hence the homogeneous part of L n (x;y) is ^j(x + y} n . Consequently, the 
solution curves are all asymptotic to lines with slope — 1. Also, L n (0;y) — 
Vn / i m P nes ma t the graph of L n (x;y) meets the y-axis in —1, —2, . . ., — n. 
All lines y — ax where a is positive will meet the curve in negative y values, 
so L n (x; ax) € pP° s fp 2 j s closed under differentiation with respect to y, as 
can also be seen in Figure ll2.il 

For the next example, we take T: x n 1— > (x) n / and pick (at random) 



f = (x + y + l)(x + 2y +2){x + 3y + l)(x + 5y +2). 
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Figure 14.1: The graph of T*f 



Figure fl4 . 1 1 shows the graph of T*f. In Figure IT2. II the solution curves all meet 
the y axis, but this is not true in Figure 114.11 Also, the line y — x does not 
intersect all the solution curves, and so f (x, x) ^ P. 

If we pick our slopes appropriately, then we can intersect all solution 
curves. 

Lemma 14.8. Suppose that f(x,y) e < ^ os , and that <x is the largest root o/f H . If 
> > a then f(x, (3x) e P pos . 

Proof. The line y — (3x meets the x-axis to the right of intersection points of 
the solution curves, and is eventually less than every solution curve, so it must 
meet every one. □ 

A partial Polya-Schur result 

Next, we look at two actions on ^2 by polynomials. 
Lemma 14.9. Suppose that ge^ and he P pos . Then 

1. h(y)g(x,y) e <P 2 . 

2. h(£)g(x,y) e*p 2 . 

3. h(4)g(x,u)e<p 2 . 

If g S ?P2° S t/iew flW t/ie statements are true with <$2 replaced by y$2° s . 

Proof. Assume that g = T*f as in the definition. If h(y) e pp° s ; then h(y) e 

P~p°\ so h(y )f (x,y ) e pP°* 2 - The first part now follows from 

T„h(y)f(x,y) = h(y)TJ (x,y) = h(y)g(x,y) 
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For the second part, it suffices to show that g + ag ' £ ^2 if a ^ 0. This follows 
from the fact that k k + ode' maps P pos to itself. To verify the third part, if 
suffices to show that g + ag y £ ^3 2 for positive a. But T* (f + od y ) = g + ag y 
since T* commutes with and f + cxf y £F 2 . □ 

We can use this result to go from an element of ^2 to a transformation. 

Proposition 14.10. I/T„(e x «) e ^ os then T: pV ° s — ► pV ° S - In other words > 
suppose g{x,y) = Y.9iM^ e V2° S - The linear transformation T: x 1 g^ 
defines a map P vos — > P pos .' 

Proof. If we choose h(x) = aix 1 £ P pos then The coefficient of y n in 

h rev (4)g(x,y)is 

^a i9 i(x) =T(h) 

Since h rev (^)g(x,u) £ $ 2 , we know that all coefficients are in P pos . □ 

14.2 Polynomials in several variables with positive 
coefficients 

In this section we look at the set Q d e of certain polynomials in d + e variables 
that have non-negative coefficients. The motivation for this set comes from 
considering these three properties of P 2 . Assume that f (x,y ) £ P 2 . 

1. f (x, a) £ P for all oc. 

2. f(a,x) e P for all a. 

3. f (x, ax) £ P for all positive a. 

Qd,e is constructed by starting with P^° s and adding e new variables so that 
the resulting polynomials satisfy certain substitution conditions. We next de- 
fine interlacing, and verify that it obeys the expected properties. The graphs 
of polynomials in Qo,2 are more complicated than those in but their be- 
havior in each quadrant is explained by their defining conditions. We find 
some general constructions for polynomials in Qd, e that are not necessarily in 
^d+e- W e then look at a special class of polynomials in Qd, e - those that are 
linear in the last e variables. They can be identified with collections of poly- 
nomials that are defined on the corners of a cube. The three variable case of 
these polynomials leads to matrices that preserve interlacing. 

Definitions 

In order to better motivate our definition, we begin by restating the definition 
for interlacing mentioned above. Let F(x,y) = f(x) + yg(x). If F(x, a) and 
F(x, ax) are in P pos for all non-negative cc then f and g interlace. This suggests 
our first definition: 
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Definition 14.11. is the set of all polynomials F in x, y with non-negative 
coefficients that satisfy the two conditions 



F(x, a) € P pos for all non-negative a 
F(x, ax) € pP° s for all non-negative a 

The general definition extends polynomials in P^° s rather than poly- 
nomials in p pos . We let x = (xi, ...,x d ), y = {yi, ■ ■ ■ ,y e ), and y£ = 
(cxix^,...,OeX^). 

Definition 14.12. Q d/e is the set of all polynomials f (x, y) in d+e variables with 
non-negative coefficients such that for all non-negative oti, . . . , <x e , all bt which 
satisfy bi G {1,2, . . ., d} and all C\,...,c e which are or 1 we have thatf (x, y? ) G 

ppos 
1 d ' 

In other words, if we substitute either a non-negative constant or a non- 
negative constant multiple of some x variable for each y variable then the 
resulting polynomial is in P^ os . If d is zero, then we substitute either a non- 
negative constant or a non-negative multiple of x for each y variable. 

Notice that the conditions for a polynomial to be in Qi 2 are a subset of 
those that define Q03. Consequently, Q ,3 C Qi,2- More generally we have 

Lemma 14.13. Ifd^l and e ^ then Q d , e c Q d -i, e +i and Qo, e +i C Qi, e . 

Proof. Since a polynomial in Qo, e +i satisfies all the conditions of Qj e plus a 
few more involving substitutions for the first variable, we have Qo, e +i C Qi, e - 
If d ^ 2 then choose f € Q d , e and consider the substitution conditions that 
must be met for f to be in Q d -i, e +i: 

f(x 1 ,...,x d _ 1 ,ai,a 2 x^,...,a e+1 x^ 1 i ) e P^ (14.2.1) 
f(xi, ...,x d -i, aiXb 1 ,a 2 x^ 2 2 ,...,a e+ ix5i 1 e e + 1 i ) e P^ (14.2.2) 

( 114.2.1b holds since 

f (xi, . . . , x d _i, x d , a 2 x£ 2 2 , . . . , oce+ix^ ) e P d ° s 

and substitution of a constant for x d gives a polynomial in P^ ^- 1)14.2.61 holds 
because we can substitute any positive multiple of a variable for a variable in 
P^ os , and the result is in P^ os 1 . □ 

Q d , e is n °t empty since P^°! e C Q d , e - We allow some coefficients to be zero, 
so Q d/e C Q d , e +i, and therefore we have the containments 

P pos = Quo = Q ,i 

c c 

Pf s = Q 2 ,o c Q u D Q ,2 

c c c 

P pos = q 3/0 c Q 2 ,i c Qi, 2 d Qo, 3 
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Example 14.14. If we choose ft < gi G P pos then it follows from the results in 
the next section that 

n 

i=l 

The polynomial f(x,y) -=x + u 2 is in Q14 but is not in Qo,2 since f (l,y) ^ P. 
The polynomial 

f = 2x + xu + (l + 3x)z + uz (14.2.3) 

is in Q12 but is not in Qo,3- To verify this we just make the four substitutions 
that define Qi,2, and check that in each case the polynomials have all real roots. 
It is not in Q03 since f (.01, x, .Olx) has imaginary roots. 

Example 14.15. Suppose that f < g where f , g G P pos have positive leading 
coefficients. We know that (xg, f, g) is totally interlacing. If oc is positive then 
the matrix below on the left is totally non-negative 

xg + od 
f + ag 




and hence 
If we define 



xg + ctf < — f + ag (14.2.4) 



h(x,y,z) = xg + zf + y(f + zg) (14.2.5) 

then (|14.2.4l l shows that fi(x,y, a) G Qi,i for positive a. Also, replacing a by 
I/a and multiplying by oc. yields axg + f < — od + g. Multiplying the right by 
x shows that 

xg + (ccx)f < — f + (<xx)g 
which implies h(x,y,ax) e Q14. Consequently, h[x,y,z) S Qi,2- 

Elementary properties 

We define interlacing, and then show that Qd, e satisfies the expected closure 
properties. If we consider the definition of Q\,\, we see that we can define 
interlacing in P pos as follows: 

f < g in P pos if and only if deg(f) > deg(g) and 
f + yg S Q14. 

If f, g e Qd, e then we say that f < g if and only if the total x-degree of f is 
greater than that of g, and f + y e +ig G Qd,e+i- All this means is that f < g if 
and only if 

total x-degree (f ) > total x-degree(g) 
f(x,y) + cxg(x,y) e Q d , e 
f (x,y) + axig(x,y) G Qd, e (for any 1 ^ i ^ d) 

We use < to define < — : 
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f < — g iff we can write g = af + h where a ^ and f < h. 

In order to show that f < g all we need to do is to show that all substitutions 
interlace. 

Lemma 14.16. Suppose that f,g e Qd,e, and that the x-degree off is greater than 
the x-degree of g. The following are equivalent 

1. f(x,yj) < g(x,y£) in F p ^ for all appropriate a and b 

2. f(x,y) +u e+ ig(x,y) G Qd,e+i 

3. f(x,y) < g(x,y) in Q d , e . 

Proof. The second and third are equivalent since they are the definition of 
interlacing. To show that (3) implies (1) note that interlacing implies that 
f+He+ig G Qd,e+i- We can substitute for y e +i and conclude that 



and the conclusion follows from Lemma flO.381 That (1) implies (3) is similar. 



Lemma 14.17. Assume that f, g,h G Qd,e- 
2. fg G Q d , e - 

2. Iff < — gthenhfi — Kg. 

3. Iff < — g and f < — h then g + h G Qd, e and f < — g + h. 

4. If g < — f and h < — f then g + h € Qd, e g + h < — f. 

5. If oi\, . . . , aa and |3i, . . . , (3 e are non-negative then 
f(a 1 x 1 ,...,a d x d , p 1 y 1 ,...,p e y e ) e Q d , e - 

Proof. We first consider the case where all "< — " are "<". The first one is 
immediate from the observation that fg(x,yj) = f (x, yj) g(x, yj) The second 
follows from the first since hf+Ue+ihg = h(f +u e +ig), and f +u e +ig G Qd,e+i 
by definition of interlacing. 

For assertion (3), in order to verify that f + u e +i(g + h) G Qd,e+i we use the 
properties of interlacing in P^ os . Since f < g, we know that by Lemma [l4.16l 



f(x,yj) + ag(x,yj) G 
f(x,yS) + axtg(x,y£) G 



ipos 
d 

,pos 
d 



□ 



f(x,yb)<g(x,y£) 
f(x,yS)<K(x,yS) 
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and adding these interlacings together gives 

f(x,5t)<g(x,yg)+H(x,y^) 

It follows that f + y e +i (g + h) e Qd,e+i- The proof of the next statement is 
similar. The last statement is immediate from the definition. 

In the general case f < — g we use the representation g = ocf + k where 
a ^ and f < h. For instance, to establish (2), we know that f <k implies 
hf <J fik, and therefore hg = ochf + hk implies hi < — fig. □ 

Factors of polynomials in Qd, e are generally in Qa,e- 

Lemma 14.18. If fg € Qi, e/ and f has only non-negative coefficients then f e Qi, e . 
If fg € Qd,e/ f Zms onZy non-negative coefficients and f(x,yf) e Y p ^ s for all 
appropriate y£ fen f e Q d , e . 

Proof. If some substitution resulted in a complex root for f , then it would be 
a complex root for f g as well. Since f has non-negative coefficients, f is in 
Qd, e . □ 



Constructing polynomials in Qi 2 

We can easily construct polynomials in Q12 using products. If f < g, h <J k are 
polynomials in P pos then f+ug e Qi t z, h+zk G Qi,2, and therefore (f+yg)(h+ 
zk) e Qi ; 2- However, these polynomials are also in P^. We do not know if 
Qi,2 is different from P p ° s 43. 

Lemma 14.19. Suppose that f, g, h, k e P pos /ja^e aZZ positive coefficients. If 
9 

^ X .. . 

f k w/iere «ZZ interlacings are "<" or all are "<" and the determinant | ^ £ | 

H 

Zzas no negative roots then 

f + yg + zh + yzke Q 1/2 

Proof. It suffices to show that f + ag < k + ah and f + axg < k + txxh for all 
positive a. The interlacing assumptions imply that all four linear combina- 
tions are in P pos . We now follow the proof of Lemma |l.52| to show that the 
first interlacing holds since the determinant in the conclusion is never zero for 
negative x. Again by following the proof of Lemma 11.521 the second holds 

™I£3I=*Ik{[I- d 

Note that Example 114.151 is a special case of the Lemma. Take 

.f 



x g g . Since 



xg f 

g f 



xg — f clearly has no negative roots, the 



Lemma applies. 

It is easy to find polynomials in Q12 using Lemma [14.191 This is a special 
case of a more general construction in the next section. 
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Lemma 14.20. Choose w(x,y, z) e Pg 05 , and write 

w[x,y,z) =f(x) + g(x)y + h(x)z + k(x)yz H 

Then f (x) + g(x)y + h(x)z + k(x)yz G Qi, 2 . 

Proof, f, g, h, k are all in P 2 , and Theorem 19.1131 shows that the determinant is 
never zero. □ 

For instance, if v(x) £ pP os / then the Taylor series expansion of v(x + y + z) 
shows that 

v(x) + v'(x) ■ (y + z) + — ^yz e Q u 
Polynomials linear in the y variables 

An interesting class of polynomials in Q^ e consists of those polynomials that 
are linear in the y variables. In general, if f is a polynomial in Q& je that is linear 
inyi,.. .,yd then we can represent f by labeling the vertices of a d-cube with 
appropriate coefficients of f . 

We can find such polynomials in Qa,e by taking initial terms of polynomi- 
als in P^ e . 

Lemma 14.21. Suppose f(x,y) e Pd°+ e / and let g(x,y) consist of all terms ofi that 
have degree at most one in every y variable. Then g g Qa,e- 

Proof. For simplicity we illustrate the proof with e = 2; the general proof is 
the same. If we write 

f(x,y,z) = f 00 (x) +fi (x)y + f i(x)z + fn(x)yz H 

g(x,y,z) =fooM +fioMy + f 01 (x)z + f u (x)yz 

then the coefficient of y z in (y + x\ a) ( z + x ■ |3 ) f (x, y , z) is 

g(x,x[a,x, s |3) =f oM + fio(x)x[a + f i(x)x, s |3 + fn(x)x t r x?a(3 
Consequently, we see that since <x, |3 are positive, 

g(x, a, |3), g(x,Xia, |3), g(x, oc,Xj(3),g(x,Xia,Xj|3) 
are all in P^ os , and so g(x,y,z) e Qd, e - □ 

Lemma 14.22. Suppose f(x,y) 6 Qi, e and has x degree r and yt degree si. The 
reverse polynomial below is in Qi /e : 
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Proof. Any substitution for yt by a constant yields a polynomial in some Qi /e < 
where e' < e. We can also scale each yi, so we only need to consider the 
substitution yi — > x. Thus we need to verify that 

x i-+«H--+«.f( _) e ppos (14.2.6) 

X X 

If a monomial in f (x,y) is a^y 1 with total degree i + |I|, then in 1 114.2. 6| | the 
degree is u — (i + |I|) where n = r + si + • • ■ + s e . Consequently, J14.2.6I I is the 
usual reverse of f (x,x, . . .,x) which we know to be in P pos . □ 

For instance, if d = 1 then Lemma [l4.22l shows that if 

fu(x) +y f 2 j(x) +zfi 2 (x) + yzf 22 (x) € Qi /2 

then the reverse is also in Qi 2 

uzfn(x) +zf 21 (x) + yf 12 (x) + f 22 (x) e Qx,2 
Matrices preserving interlacing 

Matrices determined by polynomials linear in y and z are closed under mul- 
tiplication. We can use this fact to show that these matrices also preserve 
interlacing. This is a special case of Theorem l24.11l 

Lemma 14.23. Suppose that f, g e Q42 are linear in y and z and write 

f = f 11 + yfi2 + zf 2 i + yzf 22 
9 = 9u +ygi2 + zg 2 i +yzg 22 

Represent f, g by matrices in the following way 
p _ rf 12 f22 \ 

\fll f21 / 

\9n 921/ 

If the product is FG = 
t/ie« hn + yhi 2 + zh 2 i + yzh 22 € Q d/2 



/hi 2 h 22 \ 
VHn h 2 i / 
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Proof. The proof is an application of the Leibnitz rule in disguise. We know 
that 

fn + yfu<hi +yf 22 
gn + zg2i < gi2 + zg22 

and multiplying by the right hand sides 

(fn +yfi 2 )(gi2 + zg 2 2) < [hi + yf22)(gi2 + zg 22 ) 
[hi +yf22)(gn +zg2i) < [hi + yf 22 )(gi 2 + zg 22 ) 

and adding them together yields 

(fu +yhi) (912 + 2922) + [hi +yf22)(gu + zg 2 i) <(f 2 i +yf22)(gi2 + zg22) 

The matrix representation of the left hand polynomial is exactly FG. 



□ 

Corollary 14.24. Iff e Qi /2 is given as in the lemma, gi < — g 2 and we set 

(hi hi\ ( Qi\ /%\ 

\hi hi) \9iJ \^ij 

then hi < — h 2 . 

Proof. We check that g = (gi + ug 2 )(l + z) is in Qi ;2 . Since gi < — g 2 we know 
that gi + yg 2 e Q14. Multiplying by 1 + z shows g G Qi, 2 . 

The matrix of g is ( g] gj ) . Applying the lemma and considering only the 
first column of the product gives the conclusion. □ 

We can use the determinant condition of Lemma [l4.19l to find such matri- 
ces. 

Corollary 14.25. Suppose that f, g, h, k g pv° s have all positive coefficients. If 

g 

S V .. . 

f k where all interlacings are < and the determinant | £ £ | has no negative 

H 

roots then the matrix (f J) preserves interlacing for polynomials inP pos . 

Example 14.26. The determinant of the polynomials in the interlacing square 
below has only positive roots, so the corresponding matrix preserves interlac- 
ing. If we multiply this matrix by ( ? n ) which preserves interlacing, we find 
that the matrix of polynomials below preserves interlacing for polynomials in 

ppos 
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4 + 88x ^->- 8 + 260x + 216x 2 

32 + 64x ^ 64 + 416x 

32x + 64x 2 64x + 416x 2 
4 + 88x 8 + 260x + 216x 2 

We can not give a characterization of the matrices that preserve interlacing, 
but we can find a few restrictions. We begin with the linear case. 

Lemma 14.27. Suppose f, g e Qi^ have the property that for all p < — q e Qi,2 it 
holds that fp + gq e Qi,2- Then g < — f. 

Proof. If we choose p = ctq for positive a then p < q, and so p(af + g) e Qj 2- 
Since factors of polynomials in Qi 2 are in Q12, we conclude that <xf + g S Qi,2- 
Next, if we choose p = axq then again p < q, and so xf + g e Q. It follows that 
g<f. ' □ 

Lemma 14.28. Suppose that 

fi2 h2\ ( §t\ = /% 
fn hi) \gi) yni 

and the ftj have the property that whenever gi < g2 in Qi ; 2 then it holds that hi < h2 
in Qi,2- Then, the fij are in Qi,2 and form an interlacing square 
f 12 in 

\ \ 

fll ^21 

Proof. The horizontal arrows follow from the previous lemma. If we take a 
limit, we may assume that gi is zero. Thus, we have that fng2 <fi2g2 which 
implies that f n <.f 12 and that fn, fj2 are in Q12. Taking g2 to be zero gives the 
properties of the second column. □ 



14.3 Epsilon substitution 

We have looked at properties of polynomials f (x,y) where f (x, a] is guaran- 
teed to be in P for only positive a. We now reduce the range of acceptable 
a. 

Definition 14.29. ^ consists of those polynomials f (x;tjn . . .,Ud) with the 
property that there is an e, depending on f , such that 

I oca I < e,...,|ctdl < e => f(x;ai,...,«d) 6 P 

It's easy to tell from the graph if a polynomial is in . For instance, Fig- 
ure [143] is the graph of a polynomial of degree 12. We can see that every hor- 
izontal line y — <x intersects the graph in 12 points if |ct| is sufficiently small, 
and consequently f 6 ^ . 

Here are a few simple facts that need no proof. 
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1. Iff,ge^thenfge^. 

2. Iffe^thenfle^. 

3. Iff €^athenf(x;0,...,0) eP. 
Let's consider two examples. 

Example 14.30. We will show that Z[Lo p iM(i)y n_i € P i where Pi is the 



Legendre polynomial. We make use of the formula 111681 
£_ ( 1 j V )Pi(x)y n - i = (l + 2xu+u 2 ) n / 2 P Tt ((x + y)(l+2xy+y 2 )- 1 / 2 ) (14.3.1) 

We show that we can take e = 1. If P n (P) = 0, then we know that |(3| < 1. 
Solving 

|3 = (x + y)(l+2xy+y 2 )- 1/2 

for x yields 

x = -y+y |3 2 - v / |3 2 -u 2 |3 2 +u 2 p 4 

Now if |y| < 1, then |x| < 1. In this case not only is 014.3. lb in P\, but we have 
the stronger result that 

\oc\^l =► ^PilxlMa^eP 1 - 1 ' 1 ' 

i=0 ^ ly ^ 

Example 14.31. Define the polynomials of degree n 

fn(*,u) = ^(x 2 + U 2 -ir 

Figure [14. 2l is the graph of f(,. We have the relationship 

Ux,y)=0iffw(- ? =L=)=0 

Note that h is, up a factor, the Legendre polynomial, and so has the same 
roots. Since fn(0,y) = h. n (y) we see that the intersection of the graph of f n 
with the y axis is at the roots of the Legendre polynomial. As n — > oo, these 
roots become dense in (—1,1). 

The relationship between f n and h n shows that the graph of f n consists of 
ovals through the roots of h n on the y axis that all meet at (1,0) and (—1,0). 
Consequently, any horizontal line close enough to the x-axis will meet the 
graph in n points, and so f n G 5\ . 

Since the smallest positive root of the Legendre polynomial becomes ar- 
bitrarily small, we see that the e for which the interval — e ^ a ^ e implies 
f n (x, a) G P becomes arbitrarily small. None the less, all f n are in P\. 
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In the case d = 1, this last observation says that the constant term (with 
respect to y) of a polynomial in IPf is in P. If all derivatives with respect to y 
are in then all the coefficients are in P. 

We now show that 2-variable analogs (114.3.21 of the Legendre polynomials 
are in &\, and consequently all their coefficients are in P. Figure [14.31 is the 
graph of p 6 ,6,i2- 



PTV,m,r(x,y) = 




(14.3.2) 




Since differentiation with respect to x preserves we may assume that 
n = 0. We need to consider how differentiation with respect to y affects the 
graph of po,m,r- In Figure [1441 we have the graph of po,6,6 and Vo,7,6- The light 
lines are in po,6,6 (see Figure \14.2} and the dark lines are in po,7,6- It is easy to 
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see that 



the x degree of po, 




2r — n — 1 n odd 



2r-u 



n even 




Figure 14.4: The graph of 3^ (x 2 + y 2 — l) 6 = and its y derivative 



We determine a point (a, b) in Po,7,6 by considering the line x = a, and 
finding the intersection of the line with the graph. If —1 < a < 1 then there 
are 6 intersection points. If (a,b) is in the graph of Po,7,6 then b is a root of 
-^Vo,6,(>{ a rV)r and so there are five possible b's, and they interlace the six in- 
tersection points. 

Thus, the graph consists of five curves joining (—1,0) and (1,0), each one 
passing between a different pair of adjacent curves of po,6,6/ as can be seen in 
the figure. 

This is the general situation: ho o,T consists of r circles x 2 +y 2 = 1. Each suc- 
cessive y derivative introduces a curve between existing curves, each joining 
(-1) and (1,0). 

Proposition 14.32. All Pn, m ,r in 414.3.21 lie in P^. All coefficients of powers ofy 
in p n ,m,r lie in p [-1 ' 1] . 

Proof. The only part that remains to be proved is that the roots of the coeffi- 
cients lie in [—1, 1], but this is clear from the graph. □ 

We can show that po,m,r only meets the axis in ±1. We claim that 




r even, r n 
otherwise 



(14.3.3) 



This is an easy consequence of Taylor series: 
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so the left hand side of (|14.3.31 is the coefficient of z n in (x 2 + z 2 — l) n which 
yields the right hand side of dl4.3.31 >. 



Chapter 



Generating functions of linear 
transformations 

The goal of this chapter is to explain the use of generating functions to prove 
that linear transformations preserve particular spaces of polynomials. 

15.1 The idea of generating function 

Suppose T is a linear transformation and S is a set of polynomials. Generating 
functions are a method for proving that T maps S to S. At its simplest, there is 
a function G, the base generating function, and a set of functions S' such that 

T(G) €S' => T:S — >S. (15.1.1) 

We say that we have a Polya-Schur result if in addition 

T:S — >S =*> T(G) G S'. (15.1.2) 

The general method relies on expressing T(f) in terms of T(G), where 
f £ S. T(G) is the induced map on G, defined by T(x' y J ) = T(x') y J . We need 
the following data, where x = (%\, . . .,x&) and y = iv\, ■ ■ ■ ,Vd)'- 



s 


A set of polynomials in x. 


S' 


A set of functions in x and y satisfying 




f(x,y) G S' =► f(x,0) e S. 


G 


The base generating function. 


W 


A map from S to maps S' — * S'. 



These combine in the fundamental identity for f e S: 



(Wf)T(G)| y=0 =T(f) (15.1.3) 
481 
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Let's verify that the assumptions above imply dl5-l.lt . If f G S and T(G) € 
S' then W(f)T(G) lies in S'. Evaluation at y = gives an element of S, so 
T(f) G S. 



s 


S' 


G 




u 


Comments 


p 


P 2 


e -*y 








Pd 


P2d 


e -xy 




-Dy 






3x2 






D y 




MAd 


MA 2d 


nf(xi 


+ut) 






psep 


7 


(l + x) 


y 


A 




P 


jrpos 


e x 




D y 


multiplier transformation 




Jfl 


e x 




D y 


multiplier transformation 


P(n) 


P2 


(x + y; 


r 




bounded degree 



Table 15.1: Generating function data 

How do we find G and W? All we need is a dimension decreasing trans- 
formation U 1 . We define W(f ) = f (U), construct dual polynomials Pi(y) satis- 
fying 



I^J 




ul pj(y)| y=0 

and define 



The fundamental identity is formally trivial, since we only need to check 
on monomials: 



U 1 T(G)| y=0 = U 1 Y_ T(x J ) Pj (y) | y=0 = ^ij) | y=0 = T(x r ). 

The only difficult part is the verification of the mapping requirement 

f x g ^ f(U)g satisfies SxS' — ► S'. (15.1.4) 

In order to use the method of generating functions effectively we must be 
able to compute T(G), and have some knowledge of the members of S'. This 
isn't always easy. 

Proving a Polya-Schur result is harder, and often requires some analytic 
information in order to show that T(G) G S'. For instance, if we take T to be 
the identity it isn't even clear why T(G) = G is in S'. 

We now discuss, with simple examples, each of the cases in Table ll5.T] 
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Example 15.1. Transformations P — ► P 

We take S = P, S' = P2, and U = — D, with dual polynomials Pi(y) 
(— y )Vi!. The base generating function is 



00 , , 



and is in P2. We need to verify the mapping condition (15.1.4) . Let f (x) € P 
and g(x,y) G P2. We know that f (— D)g € P2, and therefore we can apply the 
method of generating functions to prove that transformations preserve all real 
roots. 



i^i 
x l i = j 



Lemma 15.2. T/ze Hadamard product x 1 © x' 
determines a map pv° s x P — ► P. 

Proof. Choose f £ P pos and define Tf (g) = f © g. If f = x n then it's easy to see 
thatT f (e~ xy ) = (— xy) n . It follows that in general T(e~ xy ] = f(— xy). Since 
f g ppos we ] <now faat f (— xy) e P2, which is contained in P2. □ 

It is interesting that the base generating function is equivalent to base gen- 
erating functions for each positive integer. 

Lemma 15.3. I/T* (1 - xy ) n e Y 2 for n = 1,2, . . . then T: P — ► P. 

Proof. Since f (— D y ) preserves P2 we know that f (— D y )T* (1 — xy /n) n is in P2. 
The result follows from the claim that 



limf(-D y )T(l XlJ 



= T(f) 



n 

By linearity it suffices to show this for f (x) = x k in which case 



.D^Tjl-^ 



y=o 



D "L(i)(|) T < %1 



n~ k T(x k ) — ► T(x k 



Alternatively see Corollary lll.891 □ 
Of course, the two approaches are equivalent: 

Lemma 15.4. I/T* (e"^ ) e P 2 then T„ (1 - xy ) n ) e Y z for n = 1,2, . . .. 

Conversely, if T(l) ^ and T*(l — xy ) n e V 2 for all n then T* (e~* v ) g P 2 . 

Proof. Since 1 — yz e P2 the operator 1 + zd y preserves P3. We have 



(l + zdy) n T,(e- 



xy ' 



2j-l) k z k T(x k )(^) = T*(1 

y=o 



Since (1 — xy/n) n — > e~ xy the second part follows from Lemma [l3.31| □ 
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Example 15.5. Transformation P d — ► P d 

The argument for P d is exactly the same as for P. If I = (ii, . . . , i d ) we let 



dyij \dy d 
The dual polynomials are 

^'■■■^H) ilWij 

Pi = — 

MI ii!---i d ! 

For an example, here is a Hadamard product result. 

Lemma 15.6. Suppose that i x , . . . , f d are in pv° s . The linear transformation 
9 i ► (fi(xi) • • • fd(Xd)) © g defines a map P d — ► P d . 

Proof. The generating function of the map is fi(— XiUj) • • • f d (— x d y d ) which is 
in P 2d since all f t e P pos . □ 

Example 15.7. Transformations "K\ — ► "K\ 

Since fxgn f(D)g maps MixM] — > "K x we know that T(e^) e ft 2 
implies T : "K\ — > "K\. 

We start wwith a Hadamard product result. 

Lemma 15.8. The map f x g^f*g determines a map P V ° B x JCj — > Jfj. 

Proof. If we fix f e P pos then the generating function of g g * f is exp f (xy). 
Since exp f e P pos we know that exp f (xy ) G J{ 2 . □ 

Remark 15.9. If T does not map "K\ — > "K\ but rather maps "K\ — > JfiUO 
then there might not be a Polya-Schur type result. 

Consider the Hadamard product T: f (x 2 + 1) © f. In coordinates this is 

T(^~ aix 1 ) = a + a 2 x 2 . 

Note that T(x) = 0, and since all at are non-negative, T : 'K\ — ► "K\ U 0. 

The generating function of T is (xy ) 2 /2 + 1 which is not in J{ 2 , so there is no 
Polya-Schur type result. In addition the induced map fails to map J{ 2 — ► IK 2 : 

T(x + y) 2 =x 2 +y 2 ^3{ 2 

In general, the map g^fx g has generating function exp f (xy ), and this 
is in % 2 if and only if exp f e P pos . 

Corollary 15.10. The following linear transformations map "K\ to "K\. 

Exponential x k H^^-x k 
k! 



Binomial x \^J X 

Laguerre x k h^Lic(— x) 
Laguerre REV x k ^L^ EV (-x) 
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Proof. See TableHU □ 

Next we have an inner product result. If f = Y_ ft My 1 and g = Y. 9i My 1 
then we define 

(f,g>=X i!f i9i 
This is clearly an inner product. The generating function of the map 

Tf : g i-» (f, g) 

is 

5>(*V)^ =£<f,*v>^ =L^* i} ^ - 

Since this is in if f G we conclude 

Lemma 15.11. Iff € Jf2 map g 1— * (f, g) satisfies "K^ x ^2 — ► Jtj. 

Sums of squares of polynomials in P pos are not necessarily stable, but the 
sum of the squares of the coefficients of a polynomial in ¥ v £ s is stable. Since 
P2° s C 3-C2 we can apply the inner product. 

Corollary 15.12. //£ fiMy 1 € 1%* ^ E f? is in "K\. 

For example, using the Taylor series Y. nr'TJ 1 yields that if f e pP° s then 




Corollary 15.13. IfT: P — > P, T: P v ° s — > P pos , and T preserves degree then 

For instance, if L n is the Laguerre polynomial then the transformation 
x n 1 ► L n (— x) satisfies the hypotheses of the corollary, so 

* — n!n! 

n 

Example 15.14. Multiaffine polynomials 

The unusual aspect of multiaffine polynomials is that the generating func- 
tion is a polynomial. There are two choices for U, and they lead to base gener- 
ating functions that are the reverse of one another. In each case the mapping 
properties follow from the properties for Pj. 

If we take U 1 = (— l)' I 'D yi then the dual polynomials are (— l)' ! 'yi and the 
generating function is 

d 

^x I (-l) |I| y I = n( 1 - x ^^ 
I 1 
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= Dy {1 d}XI with dual polynomials y{i,...,d}\i- The gen- 
ii 

Y_ X iyd d}\l = ri( x i + Vi) 

I 1 

Example 15.15. Transformations P sev — > P sep 

We do not yet understand this case. We have S = P sep , U = A y , with dual 
polynomials (y ) t /i!. The base generating function is 

^ .(y). 

Ix l ^ = (l + x)« 
i=o x ' 

Note that the base generating function is not entire, but is analytic for |x| < 1 
and all y. It is not clear what S' is, and we do not know how to prove the 
mapping property. 

Example 15.16. Polynomials with separated roots 

We start with the fact that A maps P sep to itself. The dual polynomials are 
(y) n /u!. Thus, the base generating function is 

v- Mi m ,u 

2_ x ^r = ( 1 + x ) 

However, in this case we do not know the correct space of functions S ' that 
contain the base generating function. 

Example 15.17. Multiplier transformations for P 

Let T : x 1 atx 1 be a multiplier transformation. If we treat this a map on 
polynomials then we know that if T{e~* v ) e P2 then T : P — ► P. However, 
since T is a multiplier transformation T(e~* v ) — T(e x )( — xy). It follows that if 

T(e x ) g f^°~ s then T(e-^ ) e P 2 , and hence T : P — ► P. 

Example 15.18. Multiplier transformations for "K\ 

Let T : x 1 1— » c^x 1 be a multiplier transformation. As above we know that if 

T(e x y) e 5T 2 thenT : Jfj — ► Jfj. NowT(e x «) = T(e x )(xy), and the only way 

for f (xy) to belong to "K 2 is if f e P^ 1 . 

Lemma 15.19. If the multiplier transformation T satisfies T(e x ) e P pos then 

1. T:P — >P 

2. T : [Hj — > Jtj 

Example 15.20. Polynomials of bounded degree 

In this case we have that T(G) € S' implies that V : S — ► S where V is a 
transformation depending on T. 



We can also take U 1 
erating function is 
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We take U = -D y and G = (x + y) n . 

U k T(G)| y=0 =^(-D v ) k T(x- k )y k Q| y=0 =(-l) k T(x- k )(n) k 

If we define V(x k ) = T(x k ) (n) k then by taking the revese and negating x we 
see that V: P(n) — ► P(n). 

In this case even the identity yields an interesting result: 

Lemma 15.21. The mapping x k i— > x k (n) k maps P(n) — > P(n). 
15.2 Multiplier transformations 

The main results of this section could be derived as special cases of the pre- 
vious section, but the proofs are easy. We are concerned with the generating 
functions of multiplier transformations x l i— > a,x\ If T(x l ) — a^x 1 then the 
generating function of T is 

OO , OO , ,| 

i=0 " i=0 

where 

f(x) = y 

* — i! 

We will consider the generating function to be a function T(e~ x ) = F(x) of 
one variable. Our goal is to prove that (A) (C) for multiplier transfor- 

mations. 

Theorem 15.22 (Polya-Schur). 

1. P pos is precisely the set of generating functions of linear transformations x 1 
QiX 1 that map P pos toP v ° s . 

2. P is precisely the set of generating functions of linear transformations x 1 i— > 
atx 1 that map P pos to P. 

Proof. If T: P pos — ► P pos then 

T(e x ) = lim T + 

Sine (1 + x/n) n e P pos we conclude from Lemma 14.161 that the polynomials 
in the limit are in P pos by assumption on T. Thus, the generating function 

is in P pos by Lemma [13.311 A similar observation shows that the generating 
function in the second case is in P . 

Conversely, assume that the generating function f is in P pos and choose 
g(x) in P pos . By Theorem 19 .871 the Hadamard-type product *' satisfies 



T(g)=F*'geP 



(15.2.2) 
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and therefore T(g) e P pos since all coefficients are positive. In the second case 
f S P and g e P pos , so we only get that T(g) G P. □ 

Remark 15.23. This result is known as the Polya-Schur Theorem II137II . Part (1) 
is sometimes stated in terms of maps P — ► P, but this can be deduced from 
the following lemma. 

Lemma 15.24. If T: P v ° s — ► pp° s / s a multiplier transformation, then T extends 
to a linear transformation P — ► P. 

Proof. Since the generating function F of T is in P pos , the result follows from 
H15.2.2I I and TheoremUjSZl □ 

Here's another consequence that can be found in [137J. 
Lemma 15.25. If f € P has all positive coefficients then f € P pos . 

Proof. If f e P then consider the linear transformation given by | |15.2.2|| . If 
g e ppos then Tg e ppos since f has aU positive coefficients. Thus, f e pP°*. 

□ 

Lemma 15.26. If~T{x x ) — cux 1 and T( (1 + x) n ) e P for infinitely many n then 
T(e x ) e P. 

Proof. If we substitute x/ n for x then we see that 

lim l(l + -Y =T(e x ) eP 

n^oo V TL/ 

By Theorem ll5.22l we are done. □ 

If T is as in the last lemma, then since (x + l) n <(x + l) n_1 , we know that 
T(x + l) n <J T(x + l) n_1 . In [42J it is shown that the interlacing is strict. 

15.3 Linear transformations P — > P 

Table 115.21 lists some examples of generating functions of linear transforma- 
tions that map P to P. The first four are elementary, and the rest are standard 
formulas (e.g. [72]). In the Hadamard product and f (xD) we must choose 
f 6 F ± . In the table Io(z) is the modified Bessel function of the first kind and 
Jo(z) is the Bessel function. In the Multiplier entry we are given a linear trans- 
formation of the form T[x x ) — tux 1 which maps P to itself. The generating 
function of T in the sense of the previous section is f (x) = Y_ al TT an d is in P. 

Example 15.27. If T: P — ► P doesn't preserve degree and the sign of the 
leading coefficient then the generating function might not be in P2. Consider 
T(g) = G(D)f(x). We have seen that this linear transformation doesn't ex- 
tend to a map P2 — ► P2 when f (x) = x. An easy calculation shows that the 
generating function is (x — u)e~ xy , which isn't in P2. 
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Name 
Affine 

Derivative 

Derivative 

Derivative 

Exponential 

Hadamard 

Hadamard' 

Hermite 

Laguerre 

Laguerre REV 

Laguerre 

Multiplier 



i=0 

oo 

L 

i=0 

oo 

i=0 

oo 

i=0 

oo 

L 

i=0 

oo 

L 

i=0 

oo 

L 

i=0 

oo 

L 

i=0 

oo 

L 

i=0 

oo 

i=0 
oo 

L 

i=0 

oo 

L 

i=0 



(ax + b) 1 



ix 1 - 1 
f(DK 

i! 

f(x) *x l 
[x r * x r = r!x r ] * x 1 
Hi 

U(-x) 
U REV (-x) 



QiX 1 



i! 

-y) 1 



i! 

-V) 



i! 

-y) 1 



i! 

-V) 



i! 

-y) 1 



i! 

-y) 1 



i! 



i! 

-y) 1 



i! 



i! 

-y) 



i! 

-y) 1 



= e -(ax+b)y 



i! 



= (EXPf)(-xy) 

= f(-xy) 

= er 2 *y-y 2 

= e-^J (2Vxy) 

= e-*«J (2Vy) 

= e-^L n (x + ij) 

= f(-xy) 



Table 15.2: Generating Functions for transformations P — ► P 
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Remark 15.28. The generating function of the linear transformation f i— ► f + ocf 
is (1 + ocy)e~ xy . Since this is in P2 it follows that f + ocf is in P for all f e P. 
This is another proof of Rolle's theorem (Theorem ll.lfr . 

Example 15.29. What happens if T doesn't map P to itself? The generating 
functions should be functions that are not in P2. Let's look at some examples. 
The Hermite transformation T(x l ) — Hi maps P to itself. Since this is not an 
affine map we know that the inverse does not map P to itself. Using identities 
in 11461 we can find the generating function of T _1 : 

n=0 v ' v ' 

n=0 v 7 

-2xy+y 2 

= e 4 

This is not in P2 since substituting x = gives e y2 ^ 4 which is not in P. 
Next, consider T(x l ) = (x) i which maps P pos to itself. We can compute 



f— ul 1 I — \iV 

T4e- xlJ ) = ^T(x i )£^ = ^x(x + l)---(x + i-l)^^ = (l+y)- x 

i=0 " i=0 

Although the generating function has a simple form it isn't in P2. This is 
because it is not an entire function: T* (e~ xy ) (l,y ] = (1 + has a pole at 

y = -i. 

An example of a different sort is given by the generating function of the 

q-derivative. Recall if A(x) = qx then the q-derivative is — The 

x(q - 1) 

generating function is 



x(q-l) 

which is just the q-derivative of e~ xy . It's not in P since substituting y — 1 
gives a function that has complex roots. 

Example 15.30. IfT(x l ) = Ht, then using the generating function we can show 
that T(P) <f_ EXP(P) without exhibiting any particular polynomial in EXP(P) 
that is not in T(P). If T(P) C EXP(P) then EXP^^fP) C P. We will show that 
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the generating function of EXP : T isn't in P2. We compute 
Y_ EXP" 1 TV)^-^ EXP" 1 (X 1 ^)^ 



= exp- 1 e-^y-y 1 



1 +2xy 



This last function is not entire. 

15.4 Linear transformations P — > P - general properties 

In this section we establish general properties of generating functions. In the 
next section we will apply some of the results to show that various linear 
transformations preserve P. 

If a linear transformation T has generating function Y(x,y), then it is easy 
to compute the generating function of the Mobius transformation Tm (§ 1616. 7b , 
where M : z 1— > ^ . To begin with, there is a very useful relationship be- 
tween the generating functions of a linear transformation T and the transfor- 
mation T yz . Recall (S l6l67t that if T(x n ) = p n (x) then T 1/z (x n ) = p n REV . The 
generating function of Tjy e is 



More generally if M is a Mobius transformation M(z) = ff^j then we 
define T M (x n ) = (cx + d) n T(x n ) ( cx+d )' an< ^ me generating function is 

^ (CX + d] T (^xTdj — = F UxTd' (cx + d]y 

If we first compose with an affine transformation then the generating func- 
tion has an exponential factor. 

Lemma 15.31. Suppose that T is a linear transformation with generating function 
¥(x,y). If S(f) = T(f(ax + b)) then the generating function ofS is e- by F(x, ay). 
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Proof. A computation: 



n k 



n! 



z — k! 4- n-k ! 

k n>k 



k! 

k 



ay) 



□ 



The next result shows that the coefficients of the polynomials defining a 
root preserving transformation are highly constrained. 

Lemma 15.32. Suppose that T : P — ► P. Choose a non-negative integer r and let dt 

be the coefficient ofx T in T(x 1 ). Then, the series Y-T=q di^r ' s zn P> an d x x i— > diX 1 
mapsP vos — >P. 

In addition, for any n t/ie polynomial Y_ (™) dix 1 is in P. 

Proof. The first sum is the coefficient of x r in the generating function of T. The 
second follows from Lemma [13. 181 □ 

In particular, the generating function of the constant terms of a root pre- 
serving linear transformation is in P . 

The leading coefficients are similarly constrained. 

Lemma 15.33. Suppose T: P — ► P, T: p pos — ► pp° s r j preserves degree, and 
the leading coefficient ofT(x x ) is c^.Then Y_ Ci^f € P, and x l i— > Cj.x l maps pi 305 fo 
itself. 

Proof. Corollary 19 .571 shows that T\/ z : P — ► P. The constant term of Ti/ Z (x x ) 
is Ci, so we can apply the previous lemma. □ 

We have seen (Corollary ll.501 that if T: x n f-n(x) maps P — ► P, then 
the map x n i — > c n x n , where is the leading coefficient of fn, also maps P to 
itself. Evaluation has similar properties. 

Corollary 15.34. Suppose that!: x n i— > f n (x) preserves degree and maps P — > P, 
then for any oc^Rthe multiplier transformation 
x n i-> f n (a)x n mapsP vos — ► P. 
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Proof. If the generating function (e xy 6 P2, then evaluation is in P, and so 

from which the conclusion follows. □ 

If we have two linear transformations S, T with generating functions in P2 
then we can multiply their generating functions by Y. cLiy n ~ x z n ~ l : 

(fs(x')i^) (f> l >tf) (irf-^) 

If X.o a i xi e pP ° s then tne third factor is in FV°* 2 so all the factors are in P 3 . 
The coefficient of y n z n is Y. Ql S ^ t^i" 1 ^* • Consequently the linear transfor- 
mation x n h-> Y_ at ^"V,^* 1 ' maps P pos to P. We can remove the factorials. 

Lemma 15.35. Suppose S, T are linear transformations with generating functions in 
P 2 . The map x i ^ (-l) i S(x i )T(x i ) maps P v ° s — > P. 

Proof. If Y_ o-ix 1 e P alt then we can apply Lemma 110.631 Simply observe (as 
above) that since 

n 

= Xa i (-l) i S(x i )T(x 1 ) 

and the differential operator maps P3 to itself so the right hand side is in P. □ 

We can use the relationship between linear transformations P — ► P and 
elements of P 2 to construct new linear transformations from old. We start with 
a linear transformation, determine the corresponding element of P2, manipu- 
late it to create a new element of P2, and then convert back to a linear trans- 
formation. Unfortunately we don't know many ways to get new elements of 
P2 from old ones. Differentiation in P2 yields some simple transformations. 
Suppose that 
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f(x,y)=^T(x l )^ 



and so — corresponds to g i— > — (Tg) 

3f - r_n.u-i 



9u (1-1) 



and — corresponds to g i— * — T(xg) 
3D 



Multiplication in P2 gives a convolution of linear transformations. 

Lemma 15.36. Let S, T : P — > P preserve degree and the sign of the leading coeffi- 
cient. Then the linear transformation below also maps P — ► P. 

Proof. Multiplying two elements in P2 yields an element of P2: 

1*^) {L^tf) -£ (t (») T( ,. W x-,) 

which establishes the lemma. □ 

Remark 15.37. This really is a property of linear transformations that satisfy 
induction. Indeed, if S,T satisfy induction, then since W(f) = (S^T^ffx + 
y))(x,x) we see that W(f) e P. The example S(g) = g(D)x, T(f) = f yields 
W(l) = 1+x 2 . This is not a counterexample, since S does not satisfy induction. 

Remark 15.38. We revisit the characterization results of § 1616.11 If T is a degree 
preserving linear transformation such that Tf and f interlace for all f G P, then 
the linear transformation T a (f ) = T(f ) + af maps P to itself. Thus, for all a, the 
generating function of T a is in P2. If G is the generating function of T, then the 
generating function of T a is G + oce~" v . Consequently, G and e~ xy interlace. 

We know that such linear transformations are given by Tf = axf + bf ' + cf 
where a and b have the same sign. The generating function of this transfor- 
mation is (ax + by + c)e~ xv . This makes it clear why we must have the sign 
condition. See Question 1 1681 

Since we do not have a characterization of functions in P2, this point of 
view does not give a proof of the characterization theorems, just an under- 
standing of them. 
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Lemma 15.39. If T is an onto linear transformation defined on polynomials in one 
variable and the linear transformation S: T(x x )yi t— * T(x^)y x maps P 2 to itself then 
T is an affine transformation. 

Proof. Since S(TV)y j ) = T[x>)y x by linearity we see S(T(f(x))y j ) = T(x j )f(y) 
for any polynomial f. Choosing f = T _l (x 1 -) shows that S(x x yi) — 
T(x')T _1 (y 1 ). The compositions 

P > P 2 5 >■ P 2 >■ P 



f (x) f (x) 1 > T(f ) V 1 > T(f ) 



f(x) f(y) 1 T -1 (f) T-!(f) 

show that both T and T~ : map P to itself, and consequently T is affine. □ 

15.5 Applications: Linear transformations P — ► P 

We now have a powerful tool for showing that a linear transformation maps 
P to itself. We use the generating function to show that many linear transfor- 
mations preserve roots. 

Example 15.40. If we start with an element of P 2 then we get a linear trans- 
formation. For instance, if f £ P, then f (x + y) £ P 2 C P 2 . The Taylor series of 

f (x + y) = Y_ f (i » = Jj-lfr™ M ~ 



x 



and consequently the linear transformation x 1 1— > (— l) 1 f is a map P — ► P. 
Precomposing with x 1— > — x shows that x 1 1— ► f maps P — ► P. 

In Corollary l7.47l we showed that the Laguerre transformation maps P alt 
to itself. Now we show that it actually maps P to itself. 

Lemma 15.41. The mapping x n 1— > L n (— x) (the n-th Laguerre polynomial) maps P 
to itself. 

Proof. It suffices to know that the generating function of the transformation is 
in P 2 . From Table \152\ the generating function is Jo(2^/xy) where the Bessel 
function ]q(z) is given by the series 



k=0 
°° f— 1l k 

k=0 
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The Bessel function has a product formula 




) 



and hence 



00 / 

n 



2xy 



) 



Since the factors of this last product are all in P 2/ it follows that Jo(2y / xy ) is in 

P poS 2, and hence the generating function given of x n 1— > L n (— x) is in P poS 2. 

□ 

Since the linear transformation T: x n 1— * L n maps P to P, we know that 
T _1 does not map P to itself. It's easy to verify that 



Since EXP 1 doesn't map P to itself, it isn't surprising that T 1 doesn't either. 

Lemma 15.42. If n is a positive integer, then the linear transformation x l 1— > L\[—x) 
maps P to itself. 

Proof. From Table H5^2l the generating function F(x,y ) of the transformation is 
e v L n (x + y). Since F(x, y ) € P2, the linear transformation maps P to itself. □ 

We can apply Mobius transformations to show that the reverse of the La- 
guerre transformation also maps P to itself. 

Lemma 15.43. The linear transformation S(x n ) = L n REV (— x) maps P to P. 

Proof. The generating function of T(x n ) = L n (— x) is e y J (2 v /xy). Since S — 
Tl/z we know the generating function G(x,u) of SisF(l/x,xu) = e~* v J (2^/y). 

Now e-* y e P 2 , and J (2,/y) S P, and hence G(x,y) G P 2 . □ 

The next result is an immediate consequence of Lemma [15. 351 

Corollary 15.44. If H n is the Hermite polynomial, and L n is the Laguerre polyno- 
mial, then the following linear transformations map pv° s to P. 



T 1 = EXP 1 o T o EXP 



• x n ^ H n (x) 2 

• x n L n (-x) 2 



• X n ^ H n (x)L n (-x) 
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Lemma l6.28l characterizes the linear transformations of the form 
T(XtJ = nr=i( x + a i) mat ma P ^ to itself- We conclude 
Lemma 15.45. The only choice of constants at satisfying 



oo / n 



n=0 \i=0 * / 

zs ai = a2 = CI3 • • • . 

We can use generating functions to get results converting complex roots 
by applying appropriate affine transformations. See 1 261 . 

Lemma 15.46. Consider the linear transformations 

T: f(x) i-ocf(x + l) + (x-l)f(x-l) 

S: f (x) ^{ix + hi{x -x) + [xx - hi{x + i) 

The following diagram commutes 

P S >P 

A 

XI— x) Xi—>IX+^ 

pj+iR 1 s> pj+tR 

Proof. It is easy to verify that the diagram commutes at the element level, so it 
suffices to show that S maps P to itself. We compute the generating function 
of S: 



oo , 



n=0 



1 

„ ) „x-i) Tt + (ix--)(x + i) 



= - ^_ e l( - 1+lx)y + e r(1+lx)u + (2i) e l( - 1+lx)y x+ (2i) e l(1+lx)y x 
= i (2x cos(y) + sin(y)) e~ xy 

Now Example [133] shows that 2xcos(y) + sin(y) S P2, so all factors are in P2, 
and hence S maps P to itself. 

□ 

We can determine more transformations that are sums of shifted argu- 
ments using Lemma 15.101 

Lemma 15.47. If f (x) = Y. a i %x zs ' n P (_1,1 ' and we define 
T(g) = ^ Q k (g(x + ki) + g(x - la) ) 

k 

then T maps P to itself. 
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Proof. It suffices to compute the generating function of T. Notice that the gen- 
erating function of the transformation x n i— > (x + a) n is e _ICV_aiJ . Conse- 
quently, the generating function of T is 

/g-kyi i gkyi\ 

ie-*y Y. a * t — = 2e ~ xy Z Q k cos ( k y) 

k ^ ' k 

Since f(x) 6 P A we know that ^Qkcosfku) e P - see the proof of 
Lemma [20.761 Thus the generating function is in P2. □ 

For example, we can take f = (x + 1 ) n . The lemma shows that if g 6 P then 

(g(x + ki) + g(x-ki)) eP- 

Now we have properties of the Legendre and Jacobi polynomials. 
Lemma 15.48. The linear transformation x k 1— > P] C (x)/k! defines a map P — > P. 

Proof. Since Jo(x) e P p 8 . it follows from .1 that f n f xu +u )f n fxu -u 1 e P 2 . 
Consequently the generating function for the transformation is in P2. □ 

We can generalize this to Jacobi polynomials. We use the Mathematica def- 
inition of Pn'^(x). 

Lemma 15.49. For A, [i > — 1 the linear transformation 

n! 



defines a map P — ► P. 

Proof. From (S3 (2.1.A)] we let 



1 r(A + i) zk 



k=0 1 



then we have that 



f A (xz - z)f R (xz + z) = £ — — P^(x) 

Now f A is a modified Bessel function and is in P pos , so we follow the argu- 
ment of the previous lemma. □ 
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15.6 Applications: the diamond product 

We can show that a bilinear mapping T satisfies T : P 1 x P — > P by showing 
that for every f € P 1 the linear transformation Tf (g) = T(f, g) maps P to itself. 

We now show that the diamond product based on the Hermite polynomi- 
als maps P x P — ► P. The proof shows that the generating function of the 

transformation is in P alt 2 by using some special function identities for Her- 
mite polynomials. 

Recall (§ 17111.14) that if T is a linear transformation then the diamond prod- 
uct is given by f g = T _1 (T(f) T(g)). 

Lemma 15.50. Suppose T(H n ) = x n where H n is the Hermite polynomial. The 
diamond product defines a mapping P x P — ► P. 

T 

Proof. We fix f G P, and show that the generating function of g f g is in 
P2. We recall two identities that can be found in [ 146 1 

min(n,k) . . . . 

H n (x)H k (x) = J_ 2'Q^.Jj!H n+k _ 2j (x) 

We simplify matters by initially setting f = x n . The generating function of 
g x n g is 




'min(TL,k 
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Applying the identity 415,6.11 yields 
H n+k - 2j (^] = (l) ^ T(x n+k - 2! ) and so 



2 1 \2 



k=0 j=0 

'2x + u 



e 



2 

By linearity, we conclude that the generating function G (x, y ) for g i— > f g is 



e -x Vf f 2x +y 



V 2 



Since G(x,u) € P2, and so the diamond product maps P x P — > P. 

□ 

The Hermite diamond product has a simple description using the Hermite 
basis, and using this description we can find some interesting linear transfor- 
mations. The diamond product is simply H n H m = H n+m . Upon fixing 

m = 1 the linear transformation f f Hi defines a map P — > P. Us- 

T 

ing the definition of the diamond product above, this shows that the linear 
transformation H n 1— * H n+ i determines a map P — > P. Since H rl <04x 2 = 

H n (H 2 + 2H ), the map H t ^ H i+2 + 2H t maps P — > P. 

15.7 Applications: generalized Hadamard products 

In this section we characterize several generalized Hadamard products. We 
have seen that the two Hadamard products x 1 * x 1 = x 1 and x 1 *' x 1 = tlx 1 
map P x p pos — > p. The next result characterizes such general Hadamard 
products. 

00 ,■ 

¥ X 

Proposition 15.51. Suppose that g(x) = 2_ aiT 7M' The generalized Hadamard 
product 

QtX' i = j 

otherwise 

defines a map P © P v ° s — > P if and only ifg(x) e P~v°~ s . 



i=0 
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Proof. Choose f = Y. ^H* 1 e P pos , and consider the map Tf : g i— > f © g. We 
will show that the generating function of Tf is in P2. Compute: 



n! 1 — n! 
= }_b n a n - 

= (f*'g)(-xy) 

Since f G P pos , and g G P^, we know that f *' g G P pos , and consequently 

(f*'g)(-xy) eP^ 2 . 

Conversely, since the Hadamard product extends to P the following is in 

P^: e x © e x = V a~ □ 
* — t!i! 

We can generalize this to products defined by 



x 1 x x * x • • • x x 



cmx 1 t = \i — i 2 • • • = id 
otherwise 



where all ch are positive. A similar argument shows that a necessary condition 
that this product determines a linear transformation (p pos ) d — > ppos j g 



. . (i!) d G 



i=0 

Next we consider maps P^" 3 — > P pos that have the form 

x y J i—* < (15.7.1) 
1 otherwise 

If at = 1 this was called the diagonal map diag, and if en = i! this was called 
diagi. See Theorem |13.9l 

Lemma 15.52. The map H15.7.11 defines a mapT: Y p ° s — ► P pos iff G = £ G 

Proof. Since T(e x e y ) = £ a iffn the condition G G P pos is necessary. From 
Theorem |13.9| we see that ch = i! determines a transformation that satisfies the 
conclusions of the theorem. Consequently if G G P pos then we can express T 
as a composition 

pP os 1 ^ ppos 




p 



pos 



The map x 1 i-> f-x 1 defines a map P pos — ► P pos since L if T" ^ P by 
hypothesis. □ 
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Note that diagi(f(x)g(y)) = f*'g. Consequently, Theorem l9,87l in the case 
both polynomials are in P pos , follows from this lemma, since we can use the 
embedding P pos x P pos — > p£ os given by fxg^ f(x)gCy). 

We next show that the only products of the form 

x 1 x x> k> Qi +j x i+j (15.7.2) 
that determine maps P x P — > P are compositions. 

Lemma 15.53. A product (|15.7.21 determines a map T: P x P — > P zjff ^ a Ti^r e 
P. This is equivalent to saying that the product factors through P: 

P x P - JsP 




multiplication 

multiplier transformation 



p 

Proof. It suffices to evaluate T(e x , e x ): 



°° x n n! 

n=0 i+)=n ' 

f (2x)- 



n! 

n=0 



□ 



We can only state necessary conditions in the more general cases. The 
proofs are similar to the above, and are omitted. 

Lemma 15.54. If the map l|15.7.3t induces P2d — ► then Y. a Jrw € Pd- If the 
map ||15.7.4| | induces P2d — ► Ud then Y_ &ijr € Pd- 

xV^{ ajx/ 1 = 7 (15.7.3) 
1 otherwise 

xV i ► a I+ ,x I+J (15.7.4) 



15.8 Generating functions on P(n) 

The utility of generating functions is that if a certain function (the generating 
function) computed from a linear transformation is in some space, then the 
linear transformation has some nice mapping properties. In this section we 
look at linear transformations such as x T i— ► x n ~ T that are only defined on 
P(n). The following proposition does not characterize such generating func- 
tions, since there is a necessary factorial. 
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Proposition 15.55. Suppose that T is a linear transformation of polynomials of de- 
gree at most n such that T*(x+y) n e P2. Then, the linear transformation x x i-> T( t * 1 
maps P(n) to itself. 

Proof. Choose g(x) = Y. a i xl e P( n )- Since g Tev (^) maps P2 to itself, we see 
that 



g rev (^)T4x + y) n 



y=o 



8 ,tv '^)|.(i) T h' , )« n - 1 

KM 



g-(T-hj T 
9y 



V=0; 



^^T(x l )(n-i)!oi 



i=0 

n!To EXP(g) 



and hence T o EXP(g) e P(n). 



□ 



The converse needs an extra hypothesis. 



Proposition 15.56. Suppose T: P(n) — ► P(n). If T* (x + y ) n satisfies the homo- 
geneity condition then (x + y ) n e Pi[n). 

Proof. We only need to check substitution, and if we substitute a for y then 

T*(x + y) n (x, a) = T(x + a) n e P. □ 

Example 15.57. Here are a few examples. 

1. Mobius transformations. Suppose T(x x ) = (ax + b) x (cx + d) n_l . Then 



T*(x+y) n ( n )(ax + b) 1 (cx+d) n - i y n_i = ( (ax + b)+y(cx + d) 

i=o 

2. Polar derivative. If T(x T ) = (n — i)x x then 

T*(x + y) n = f_ ( n ) (n - i)xV _l = ny(x + y) n 



3. Reversal (with a negative sign). If T(x x ) = (— x) n 1 then 

T*(x + yr = Y_ r^(-l) n - l x Tl - V"' = d-xy) T 



i=0 
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4. Hermite. Let T(x n ) = H n . From l|6, 8.211 we know that 

T*(x + y) n = T(x n )(x + 2y) = H n (x + 2y) e P 2 . 

This implies that x n i— > H n /n! maps P(n) to itself, but in this case we 
know T maps P(n) to itself. 

5. In Lemma 18.301 we saw that the linear transformation T: x k i— ► 
( x + n — k ) maps P alt (n) — > P(n). The generating function 



T*(x + y) n = Y_ (1) ( x + n-k Lu"' 1 = (y + l) n (x) n 

i=0 ^ ' 

is in P 2 (2n). Proposition [1535] only allows us to conclude that 

x k ^ (* + = ^ n maps P(n) — > P(n). (15.8.1) 

Since it is not true that x k i— > ( x + n — k ) maps P(n) to itself, we see 
that the factorial in Proposition [1535] is necessary. 

15.9 Higher dimensional generating functions 

If T: Pd — > Pd is a linear transformation, then its generating function is a 
function of 2d variables: 

Here are some examples. 

The generating function of the identity transformation is e~ xy . 
Suppose that T: P — ► P and let F(x,y) be the generating function of T. If 
we define T* [x l yi ) = T(x l )y> then the generating function F* of T* is 



F,(x,y,u,v) = Y_ T ( xt )l) 



-u) 1 f-vV 



J j! 



More generally, if we have another linear transformation S : P — ► P with 
generating function G(x,y) then the generating function of x*y' i— > T(x x )S(u' ) 
is F(x,u)G(y,v). 

If T: P 2 — > P 2 is a linear transformation and a, b are positive, then S(f) = 
a^T(f ) +b^T(f ) satisfies T(f) < S(f ). If T has generating function F, then the 
generating function of S is aF x + bY y . 
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The generating function of the linear transformation g i— ► f(3 x , 3^)g has 
generating function f (u, v)e~ xu ~ vv . This follows using linearity from the cal- 
culation 



i! — ~ v. I i 1 — a " j! 

= (u T e~ xu ) (v s e- y "j 

In general, the generating function of g i— > f(3 x )g is f(x)e~ x ' y . Similarly, the 
generating function of f h-> f (3 x )g is g(x + y). 

Proposition 15.58. IfJ is a linear transformation on and T*(e~ x ' y ) e P2d #zen 
T:P d ^P d . 

Proof. Since f(3 x ) maps P2d to itself the proof is the same as the case d = 1. □ 

Corollary 15.59. IfSisadbyd positive symmetric matrix, and Hj(x) is the corre- 
sponding Hermite polynomial then the linear transformation x 1 1— > H f (x) maps to 
Pd- 

Proof. The generating function of the Hermite polynomials dl3-5.ll > is 
exp(— ySy* — 2ySx* ) which is in P2 d . We can now apply the Proposition. □ 

The proposition implies that differential operators preserve Pd- 
Lemma 15.60. Iff G then the linear transformation 

T:g^ f(3x)g 

maps Pa to itself. 

Proof. The generating function of T is f(y)e _x y which is in P2d- □ 

Table [1531 lists some generating functions in higher dimensions. Note that 
these transformations do not all map P d to itself since not all of the generating 
functions are in P2d- The operator EXP X is EXP Xl • • • EXP Xd , and the generalized 
Hurwitz transformation is defined as 

_^ !j | x I/2 if all coordinates of I are even (15 91) 

I otherwise 



15.10 Higher dimensional multiplier transformations 

The Polya-Schur theorem does not generalize to two variables. Although we 
will see that all multiplier transformations are products of one dimensional 
multipliers, there are examples of transformations that do not map P2 to itself, 
yet have a two variable generating function in P2. 

118 1 observed that all higher dimensional multiplier transformations are 
just products of one dimensional maps: 
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Name 

differentiation 

Hermite 

Hadamard 

Hurwitz 

identity 



i 

Y_ H iM 

I 

Y_ f M * x 1 



i 



-y) 1 



I! 

-y) 1 



I! 

-y) 1 



I! 

-y) 1 



I! 



-y) 1 



f(y)e- x -y 

e -ySy'-2ySx' 

EXP x f(-xiy 1/ ...,-x d -y d ] 

cosh(ui^x7) • • • cosh(y d% /x7) 

p-x-y 



Table 15.3: Generating Functions in higher dimensions 



Lemma 15.61. The following are equivalent: 

1. T : x 1 — ► otjx 1 maps P d to z'fse/f. 

2. T is a product of one-dimensional transformations. 

Proof. We first assume that d = 2, so assume that TfxHj' ) 
fl8l we apply T to two test functions 



iXij. Following 



TxV(l +x)(l + y) = xV(«i,j + a i+ i,jx+ ay+iy + ai+ij+ixy) 
TxV(l +x)(l —y) = x l y'(ay + «i+i,jX- ay+iy - oti+ij+ixy) 



By Proposition II 1 .261 or Lemma [20. ll both of the factors are in P2, and by The- 
orem|93l3] 

OyOi+y+i - at+ijay+i ^ 
— ayOi+i/j+i + ok+ijay+i > 

and therefore 

OyOi+lJ+l = Oi+l,jOy+l (15.10.1) 

We now consider the support S of T - that is, the set of all (i, j) such that 
ay 7^ 0. If r is a non-negative integer then the composition 



T x"u' 

" P 2 — ~+ P 2 ~ 



P 



determines a map P — ► P. Since this map is a multiplier transformation 
in one variable we know that the non-zero coefficients have no gaps, and so 
form an interval. Thus, all intersections of the support S with a horizontal or 
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vertical line are intervals. The identity (|15. 10.lt shows that we can not have 
the configurations 






















in the support, so it follows that the support is a rectangle. 

We may assume that 1x0,0 = 1- Using the recursion 1 115.10. 1\ and induction 
yields otij = 1x1,0 «o,j • It follows that T is a product of the one dimensional 
transformations 

T r :x n 1 > a n ,ox n T c :y n ^ a , n y n 

The case for general d is no different; we use the fact that the intersection 
with every d — 1 dimensional face is a product to conclude the support is a 
product. □ 

Example 15.62. Here is an example of a multiplier transformation that does 
not map J? 2 — * P2 but whose two variable generating function is in P2. The 
two variable generating function of T: x x y' 1— ► (i+ j) 2 xhj' is 

00 (■ _i_ -)2 

Y_ ^TT-*V =e x+ ^(x + u)(x + u + l) 
y=o 10 " 

which is in P 2 . We know T maps P!^ to itself, but T does not map P 2 to itself. 
If 



f = -193 + 43 x + 21 x 2 + x 3 + 262 y + 196 xy + 14 x 2 y + 448 y z + 64 xy 2 + 96 y 3 
then Tf (— 2,y ) has complex roots. Note that the usual generating function is 

Y (i + j) 2 xV ^~ U ?'?r V ^ - e- ux - vy (ux + vy ) (ux + vy - 1) 
* — ii 

y=o J 

and the latter expression is not in P4. 

Remark 15.63. In Corollary 111.1141 we saw that the linear transformation 
xHj' 1— * . - j maps Pj 05 to itself, yet it does not map P2 to itself. The gener- 
ating function is 

r- x*y' xe x — ye y 
+ x-y 

This can be seen as follows. If the generating function is s, then the terms of 
~ (s — 1) largely cancel, leaving only ^e x ~ e v . This generating function is not 

in P2 since it is not an entire function. 
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Just as in one variable (Lemma 1 1.491 a linear transformation T: P2 — ► P2 
determines a simpler linear transformation. 

Lemma 15.64. If T: P2 — > P2, we can ron'fe 

T(x 1 y') = gij(x,y) = cyx l y' + terms of lower degree 
then the linear transformation S(x T y' ) = CyxHj' aZso maps P2 — > Pa- 

Proof. Choose f = ayx x y' in P 2 . Substitute x/a for x, y/|3 for y and apply 
T. Next, substitute ax for x and (3y for y. The result is that the following 
polynomial is in P2: 

Y_ aijO^P'gyfx/ocy/P) 

y 

Next, using the fact that 

lim + lim + a l (3' g tj (x/a,y/|3) = CyxV 

we see that 

lim + lim + Y_ atja l (3'gij(x/a,y/[3) = Y_ OijCijxV = S(f) G P 2 

y u 



□ 

Example 15.65. The two variable generating function of x T y s 1— * f (r + s)x r y s 
can be easily found. 

£(i + j) u ^ = e* + ^T- 1 (x+y) n 

where T: x n = (x) n , and T^ 1 (x r y s ) = T _1 (x T )T _1 (y s ). By linearity we find 
the generating function 

Z f ( x + y)^~ = e x+V T**n* + y) = (e^f^x + y) 

where the last equality follows from Example l6.53l It follows that if f S P pos 
then this generating function is in P2. We saw that the linear transformation 
corresponding to f (x) = x 2 does not map P2 to itself. 
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15.11 Differential operators 

We continue the investigation of differential operators. We first compute the 
generating functions of differential operators acting on P, extract some prop- 
erties, show some eigenpolynomials are in P, and then extend some of the 
results to Pa- 

Consider the linear transformation T defined on polynomials of one vari- 
able 

Tl 

T(g)=^f i (x)g( i '(x) 

i=0 

We define f(x,y) = ^fiMy 1 , and write T(g) = f(x,D)g. The generating 
function of T is 



2_T(x')— ^=^_^_ft(x)D x>-— 

j=0 '' j=0 1=0 '' 



i=0 J=0 
n 

= ^ft(x)D l e-^ 

t=0 

n. 

i=0 

= f(x,-u)e- x « 

Thus, we have 

Proposition 15.66. Iff G P2 then f (x, — D) maps P to P. Conversely, if 

1. f(x,-D) maps P to P. 

2. The coefficients of the homogeneous part o/f (x,y ) are all positive, 
then f G Pz- 

Proof. The generating function is f(x,u)e~ xy G ¥2- In Lemma 17.141 we saw 
that f (x, a) G P for all a G R, so f G P 2 - □ 

Remark 15.67. For example, if we take F = x + y then the linear transformation 
is f 1 ► xf f '. We know that this is in P since xf < f < f implies xf — f <f. 

Corollary 15.68. Suppose that!: P — ► P. Tfoe Zmear transformation 



g^^ T (x k )^--^(-l) k 



maps P 
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Proof. If f(x,y) is the the generating function of T, then f (x,y) G P 2 - The map 
isg(x) h->f(x,-D)g(x). ' ' □ 

In IITOl they show that for n sufficiently large there is an eigenpolynomial 
f (x, D) of degree n. We now show that the eigenpolynomials are in P if f G P 2 , 
and f satisfies a degree condition. 

Lemma 15.69. Choose f (x,y) G P 2 (d) 

f(x,y) = f (x) + fi(x)y + f 2 (x)y 2 + • • ■ + f d (x)y d 

where the degree of f t is i. If n zs sufficiently large then there is a polynomial p e P(n) 
and constant A suc/z that f (x, D)p = Ap. 

Proof. Since f (x, D) maps P to itself, and preserves degree, we will apply 
Lemma 17.851 It suffices to show that there is a dominant eigenvalue, which 
is the same as finding a dominant leading coefficient. 

The r-th diagonal element of M is the coefficient c r of x r in f (x, D)x r . De- 
noting the leading coefficient of fi by Ct, 

d 

c d =X c i (l)i = C d (r) d + 0(r d - 1 ). 

i=0 

For n sufficiently large, c n is the largest eigenvalue of M. 

□ 

Remark 15.70. The identity e - (a * +3 « )2 (x + y) n = (x + y) n shows that the 
operator e~' 3 * +3lj ' 2 has polynomial eigenvalues of every degree. 

We can generalize some of the above to more variables. Suppose that 
f (x, y) G Pd+e and we define a differential operator 

T(g(x)) = ^f I (x)D I g(x) (15.11.1) 

It is easy to see that the generating function of T is simply f (x, y)e~ x y . 
Proposition 15.71. Suppose 

1. f(x, -D) maps P d to Pa- 

2. The coefficients of the homogeneous part off (x,y) are all positive. 

then f G Pd+e- 

Conversely, if f(x,y) G P 2 d then f (x, — D) maps P<j to itself. 

Proof. If suffices to show that f(x,y) satisfies substitution. The proof is the 
same as Lemma[7T4] applyf(x,— D) to e~ a ' x to conclude that f(x, a) e~ ax G Pd- 
Multiplying by e a x finishes the proof. 

The second part follows form the proposition. □ 
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Which polynomials f (x,y) determine operators f (x, D) that map P pos — ► 
P, and don't map P — ► P? Here's a necessary condition. 

Lemma 15.72. Suppose that f(x,y) determine an operator f(x, D) that maps 
pp°s — > p if-f(x,—y) satisfies degree and positivity then f(x, — y) 6 

Proof. Let f (x, y) = Y. Q-ijxH)'. Since e~ ax e pP° s for positive a, we see that 



-<xY ' "" ax 



is in P, and hence f (x, —a) G P for all positive a. Since f (x, — y ) satisfies degree 
and positivity f (x, — y ) e P\ t \. □ 

15.12 Generating functions for maps P2 < — > P 



If T: P2 — ► P then the generating function of T lies in P3. We compute the 
generating functions for many of these maps. 

The most basic map of all is evaluation. The generating function of 

f(x,y) h-> f(z,<x) is 



2_ T ( x y J — — = 2_ z a 



-uN-vV 



x\)\ 

Next, consider the diagonal map f (x,y) 1— > f (z, z). The generating function 

is 



-uN-vV v- (JJ (-u)M-v)' 



— (u+v)z 



T(xy ] = 2_ z = e 

The map that extracts the coefficient of a fixed monomial also maps P2 
P. If T(x^ J ) = z 1 if j = k and otherwise then the generating function is 



-ur(-v) k (-v) k e _ uz 



Finally, we consider the map f(x,y) 1— > f(D, x). More precisely, this is the 



mapxhj' i-> -£rz> = (j) z? \ 



f-u) l (-v)J f (-zv)' X" /)\ /-u 



i<j J j=0 ; i=0 v 

)=0 



j! v z 



g — zv+uv 



We summarize these few results in Table [15741 

We can construct transformations P — ► P2 by homogenizing transforma- 
tions P — > P. 
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Name 








evaluation at a 


f(x,-y) h 


-> f (z, a) 


g-uz-av 


diagonal 


f(x,y) H 


f(z,z) 


e -z(u+v) 


coefficient 


f(x,y) h 


-> coef. of y k 


k! e 


derivative 


f(x,y) h 


-> f(-D,z) 


e -v(u.+z) 



Table 15.4: Generating Functions for P 2 — ► P 



Lemma 15.73. Suppose T:P — > P and T(x n ) e P pos (n). If Tlx 1 ) = f t (x) and 
Tntx 1 ) = fiU/u)^ f/zen T H : P — » P 2 . 

Proof. If g = 2^ nix 1 e P(n) then Tn(g) = X! <ufi(x/y)y' 1 ' The homogeneous 
part of T)-i(g) is f n (x/y)y n which has all positive terms since T(x n ) e pP os . 
Substituting a for y yields 

T H (g)(x,a) = ^atfitx/ocja 1 = Y (« T Qi)fj(x/cx.) = T(g(ax))(x/a) 

which is in P. □ 

If the generating function of T is G (x, y ) then the generating function of Th 

is G(x/y,yz). 

Note that evaluation at x = 1 shows that T: x l — > f[ ev maps P — ► P. 
This gives another proof of Corollary 19. 571 

Example 15.74. If T is the affine transformation x 1— > x + 1 then Ti-i(g) = 
g(x + y). For a more complicated example, if T(x x ) = Li(— x) is the Laguerre 
transformation then 

" \ / 



15.13 Linear transformations P pos — > P 

In this section we consider maps P pos — ► pp° s Tf we have a linear transfor- 
mation T: P alt — ► P Qlt then we can construct a map S: P pos — ► P pos by 
S(f)(x) = T(f(— x))(— x). The two generating functions satisfy 



f(x,y) = VT(x u )^ 
z — n! 



g( X/ y) = ^S(x n ) [ —^- = Jj-l)*T(x n )(-x)^ 



and so g(x,y) = f(-x,-y). 
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Theorem 15.75. Suppose T: p pos — ► P pos , T preserves degree and the sign of the 
leading coefficient. If 

CO , 

F(x,y) =T*(e- xv ) = £T(x l )J-^- (15.13.1) 

i=0 

then F(x,y] e P u . IfT: P alt — > P alt then F(x,-y) € P u . 

Proof. It suffices to assume that T: P pos — > P pos . Since e" x ^ e P 2 C we 
know that F(x,y) = l*(e- %v ) E S> u . □ 

Theorem 15.76. Suppose that F(x,y) = ^fi(x) — - — is in P^. If each ft is a 
polynomial and we define ~T{x x ) = f i/i! then T: p pos — ► p. 

Proof. We follow the proof for Pd, but since S\i is not closed under differenti- 
ation we need to multiply. This is why we have the factorial appearing. □ 

Since S\i is closed under multiplication, so is Look at Table [1531 We 
know that e~ ax G P2, and (1 + y) x £ &\ r \ since the latter is the generating 
function of a linear transformation P alt — ► P alt . Their product is in and 
is the generating function for the Charlier transformation. 

Table 115.51 lists the generating functions of some linear transformations 
P alt — > P alt . Table [15.131 lists the generating functions of linear transfor- 
mations P pos — > P whose image is not P pos . Notice that most of these gener- 
ating functions are not in P2 - indeed, they are not analytic for all real values 
of the parameters, but only positive y . 

Example 15.77. We can determine the closed formula for the generating func- 
tion F(x,y) of T: x 1 1— ► (— lj^x 1 given in Table H531 by considering four sums 
determined by the index mod 4. The result is 

00 f 
F(x,y) = J^C-ljCa x iL = V2cos(xy - tt/4). 

1=0 

The cosine has a representation as an infinite product 

™ / 4 X 2 \ 

so the generating function can be expressed as the infinite product 
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Name TJe"^) 

Binomial Pj - V ^ = iFi(-x,l,-y) 

i=o x " 

Charlier ^ C n = e~ ay (1 +y)* 



i=0 



Falling Factorial ^ = U-y) 



i=0 



i! 

,2i 



Hurwitz 5~ x 1 — — = coshf-v/xy 2 

^— (2i)! v 



i=0 



oo 4^ 

Hurwitz ^ 77^7 = 5 (cosfx 1 ^) + coshlx 1 ^)) 

i=o ^ >■ 

q-series ^ (-l)^x 1 = \/2cos(xu - 7t/4) 



i=0 



Rising Factorial ^ «*>i * l ) = e x(1 - £ ~ y; 



i=0 



Table 15.5: Generating functions for P alt — ► P alt 



Name TAe~ xy ) 

i! 



oo f 11 

Hermite H n V = e 2y ~* w 



i=0 

°° f-ul 1 
Hermite V x n H n = e 2 * 2 ^-^ 2 

^— i! 



i=0 



Laguerre £ L " V ^T" = ^MVu) 
i=o x - 



Table 15.6: Generating functions for P pos — ► P 
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The product is not in P 2 as the factorization shows 

/ _ 4(xy-f) 2 \ = / 2(xy_f) \ / 2(xy-f) \ 
^ (2k + l) 2 7tV ^ (2k+l)7tJ V (2k + l)7tJ 
= Fi(x,y)F 2 (x,y) 

T : i(x,y) G P2, but f"2(x, — y) G P2- This reflects the fact that the linear transfor- 
mation x l 1— > (— l)h>x x maps P pos to P, and does not map P — > P. 

Example 15.78. One way to find elements in is to compute the generating 
function of simple linear transformations. The generating function of U(x n ) = 
if n odd, and x n if n even is 

00 2n 

^ x2R (ln7 =COSh(Xy) 

Tt=0 * '' 

while the generating function of V(x n ) = if n even, and x n if n odd is 

00 2n+l 

X^ +1 ^TT=sinh(xy) 

T1=0 1 



Let T e (x n ) = if n is odd, and x n/2 if n is even. We know that T e : P pos — > 
P pos . Consequently, the map S(f) = T e (f)(l - x 2 ) maps P pos — > P. The 
generating function of T e is for positive x 



y 2n - 

2_ xn j2ny_ = cosh(y Vx) 



T1=0 



and the generating function of S is 



00 2n 

Y (1 - x 2 ) n ^-r = costyVl-x 2 ) 



n=0 



Finally, let T D (x) = if n even, and x ,n 1)/2 if n odd. The generating function 
of T Q for positive x is 

^ Xn f2n + lV = ^* smh ^ 

n=0 V 

15.14 More generating functions 

In this section we list in Table HBTMl a few generating functions for linear trans- 
formations that map p fl - nlte — > pJ. These functions are not entire - in order 
to converge we must assume that |x|, |y| < 1. 
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1*1*- i 


Chebyshev 


oo 

z 

i=0 


Ti 


i! 


= e x v C osh(Vy 2 (^ 2 -l)) 


Legendre 


oo 

L 


Pi 


i! 


= e^ Fi(;i;|(u 2 -i)x 2 ) 


Euler 


i=0 
oo 

L 


At 


i! 


e x y - xey 


Factorial 


oo 

L 


nlid-ia) 


i! 


= (1 +xy)- 1+1/x 




i=0 









Table 15.7: Miscellaneous generating functions 



Since the method of finding bivariate generating functions may not be fa- 
miliar, I'll describe how to find the generating function for the Euler transfor- 
mation. We compute the generating function of x n i— > A n by first computing 
the generating function of x n i— ► B n = x n A n ( 2 ^-). We use the recurrence for 

to derive a partial differential equation for the generating function, which 
we use the method of characteristics to solve. So recall (Lemma 17.601 1 that B n 
satisfies the recursion B n+1 = (x + l)(B n + xB^J. Substituting this into the 
generating function for T 

oo n 

F(x,y) = ^B n ^- 

T1=0 

oo n 

= ^(x + l)(B n +xB;) \ 

Tl=0 n ' 

with some manipulation yields the partial differential equation 

(x + l)F(x,y) = (-x 2 - x)F x (x,y) + T y (x,y) (15.14.1) 

The method of characteristics assumes that x, y are functions of s, t, where 
we consider t a variable, and s a parameter. We choose these functions to 
make each side of 1)15. 14.111 an exact differential. Thus, we have to solve the 
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three equations 

d 



-=-x(s,t) = -x 2 -x (15.14.2) 
at 

-Jy(s,t) = l (15.14.3) 
at 

-^-F(s,t) = (x + l)F(s,t) (15.14.4) 
at 

with the initial conditions 

x(s,0) = s 

y(s,0) =0 

F(s,0)=l 

We solve | |15.14.2| | and 1 115.14. 3b for x,y, and then use these two solutions to 
express 1 115.14. 4b entirely in terms of s, t. We solve that equation, solve for s, t 
in terms of x,y, substitute into our last solution, and we are done! Of course, 
a computer algebra system is indispensable for these calculations. 

e y 

The solution is F(x, y ) = . We then apply the first Mobius trans- 

1 + x — xe« 

formation z ^ 1/z which yields the generating function F(l/x, xy), and then 
z i— > z — 1 which gives F(l/(x — 1), (x — l)y) which simplifies to our result in 
the table. 
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Recurrences with polynomial 
coefficients 

How can we construct sequences of real-rooted polynomials {fil^g such that 
for all positive n the ft satisfy the recurrence 

frtMgoM + Wx)gi(x) + ■ • • + WMgaM = o (16.0.1) 

where the gi are polynomials in x? We say that the sequence {f,} satisfies a 
recursion with polynomial coefficients. We will show several ways of doing con- 
structing such sequences. Unfortunately, we don't know how to reverse the 
process. That is, given a sequence that satisfies a recursion with polynomial 
coefficients, prove that all the terms have all real roots. 

Example 16.1. Here are two simple examples. If ft = x 1 then we have the 
recurrence 

f n (x) -x-f n+1 (x) =0. 

A less trivial example is given by the Chebyshev polynomials of the 
second kind. They satisfy the recurrence 

U n (x) - U n+1 (x) • 2x + U n+2 = 

In each of these cases the consecutive terms interlace, but this will not be 
true in general. 

The Hermite polynomial H n satisfies the recursion 

H n+1 = 2xH n - 2nH n _i 

This is not a recursion with polynomial coefficients since the factor (2n) de- 
pends on the index. However, they do satisfy a differential recursion with 
polynomial coefficients 

H n = 2xH n _i — H i. 
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Every sequence that satisfies a recursion with polynomial coefficients also 
satisfies a determinantal identity. This has nothing to do with the property of 
having real roots or not. Given a sequence as in 1116.0. Il l we have the identity 



/ fn fn+1 
fn+1 fn+2 



90 91 



9d 



fn+d ^ 
fn+d+1 



\fn+d fn+d+1 ' 1 1 fn+2d 

If the Qi are not identically zero the determinant is zero: 



= 



fn fn+1 
fn+1 fn+2 

fn+d fn+d+1 



fn+d 
fn+d+1 



f 



n+2d 



16.1 The general construction 

Many of our recursions arise from repeatedly multiplying by a polynomial. 
Here's a trivial example: 

fo=f 

f n = 9 f n-1 

where f, g € P. More generally we turn polynomial multiplication into matrix 
multiplication, and the characteristic polynomial becomes the recursion. 
So, our construction requires 

1. A d by d matrix M that might have polynomial entries, or perhaps even 
linear transformations as entries. 

2. A vector v of polynomials. 

3. For i = 0, 1, . . . , all the entries of M l v have all real roots. 

If YJq a i^ x is the characteristic polynomial of M and n = 0, 1, . . . , then 

d 



i=0 



Thus if we write 



M l v = [Vi.i, ■ ■ ■ ,Vi,&f 
then each sequence of polynomials {pi,k}tLi has all real roots, and satisfies the 
same recursion 

d 

Y aip i+n , k = 0. 

t=0 

The vector v of polynomials can be 



CHAPTER 16. RECURRENCES WITH POLYNOMIAL COEFFICIENTS 520 



1. A collection of mutually interlacing polynomials. 

2. All coefficients of a polynomial in P 2 . 

3. The initial coefficients of a function in P 2 . 

16.2 Recursions from f(y) and g{x,y) 

In our first construction we always get the same recursion; this is due to the 
fact that our matrix is lower triangular. We start with the following data 



f(y) =X a ^ 

o 

g(x,y) = Y_ <3iWv x 
f(y)g(x,y) = ^H i (x)y i 

and we have the relation 



e P 

GP 2 











.. 0\ 




/g<A 




/ho\ 









.. 




9i 




hi 


a 2 


ai 


a 


.. 
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h 2 



\a d a d _i a d _ 2 ... a / \g d / 



\h d y 



Since f (y)g(x,y) e P 2 it follows that all the coefficients ho, . . . , h d have all 
real roots. We follow the outline in the previous section. The characteristic 
polynomial is (A — ao) d . If we write 



then all {pi,k} satisfy 

Pn,k - 



1 



J a p n -i,k + 



+ 



-l) d a d p n -d,k 







Note that the recursion is effectively independent of f if Qo ^ 0, since we 
can rescale to make ao = 1. Here's an example where d = 3: 



M 



f = (y + i) 3 

(1 0\ 

3 10 

3 3 10 

\1 3 3 1/ 



(x + 2y + l)(x + 3y +2) 
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v = 



fx 2 + 3x + 2\ 
5x + 7 
6 


„2 



V 



M 2 v = 



6x 2 
15x 2 



\20x 

The four sequences 



h3x + 2 
23x + 19 
h 75x + 78 
135x + 181 / 



Mv = 



M 3 v = 



-3x + 2 



-3x + 2 



x 2 + 3x + 2 



/ x 2 

3x 2 

3x 2 
V x 2 - 

/ : 
9x 2 



+ 3x + 2 \ 
h 14x + 13 
h 24x + 33 
■ 18x + 41 / 

2 + 3x + 2 
32x + 25 



\ 



36x 2 + 153x 
\84x 2 + 432x 



141 
-474/ 



-3x + 2 



5x + 7 3x 2 + 14x + 13 6x 2 + 23x + 19 9x 2 + 32x + 25 
6 3x 2 + 24x + 33 15x 2 + 75x+78 36x 2 + 153x + 141 

x 2 + 18x + 41 20x 2 + 135x + 181 84x 2 + 432x + 474 

have all real roots and satisfy the recurrence 

Pn - 3p n _i + 3p n _2 - Pn-3 = 0. 



16.3 Recursions from f (y, z) and g (x, y, z) 

Given a polynomial in P3 and v the vector of the first k coefficients of a poly- 
nomial in P2 we can construct a matrix that preserves such initial sequences. 
This gives us recursions of real rooted polynomials. We give the general con- 
struction, and then give some examples. 

Construction 16.2. We are given f = ^fi,j(x)y 1 z' in P3, and g(x,y) = 
QiMy 1 is the first r + 1 terms of a function in P2. . 
We define 

vo = (go,...,g r ) 
By Theorem l24.11l we know that all polynomials 
M k v = v k = (v ,k, ■ • ./V T ,ic) 
have all real roots. Suppose that M satisfies 



Then the sequences 



Y_ ociM 1 = 



90 — v 0,0/ v 0,l' v 0,2/ • • • 

gi = vi,o,vi4,v 1/2 ,... 



gr — V T ,0,V T ,l,V Tt 2, ■ 
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all satisfy the same recurrence 



i=0 



Example 16.3. We start with 



x + 1 1 1 

1 V 1 
1 1 z 



= yz(l + x) - y - z + (1 - x) G P 3 



The matrix 



M 



-1 1-x 
1+x -1 



preserves initial sequences of length two. M satisfies its characteristic polyno- 
mial 

A 2 + 2A + x 2 = 
If goM +V 9iM G P2 then the polynomial sequences 



-1 1-x 
1+x -1 



are in P and satisfy 



g n + 2 g n _j + x 2 g n _ 2 = 
f n + 2f n _ 1 +x 2 f n _ 2 =0 



Now we choose go = —1 and gj = x. Since go + ygi = xy — 1 e P 2 we can 
apply the above construction 



v = 



Mv 



-X 2 + X + 1 

-2x-l 



3x 2 - 2x - 2 
-x 3 + 4x + 2 



giving us the two sequences of polynomials 



-1 -x 2 



1 3x 2 - 2x - 2 x 4 - x 3 - 7x 2 + 4x + 4, 



f : x -2x - 1 -x 3 + 4x + 2 4x 3 + x 2 - 8x - 4, 

Consecutive g/s or ft's do not necessarily interlace, but they all are in P. How- 
ever, gt and f\ do interlace. 

Example 16.4. The polynomial xy +xz+yz— 1 is in P 3 since ^101^ is negative 
subdefinite. For a particular choice of v we have nice formulas for the two 
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polynomial sequences. 

*-(; *) 

CP(M) = y 2 - 2xy + x 2 + 1 

Pn = 2xp n -l - (X 2 + l)p n -2 

v = (x,l) 
M k v = (f k ,g k ) 

We know that f k and gk have all real roots since they are derived from the 
even and odd parts of (x — l) n 

Example 16.5. In this example we construct a sequence of polynomials in two 
variables that satisfy a three term recursion. We begin with the Grace polyno- 
mial that is in P 4 

Grace 2 = 2x a x 2 + (x x +x 2 )(y a +y 2 ) + 2yiy 2 

Assume that f = fo(xi,x 2 ) + fi(xi,x 2 )yi e P3. Consider the coefficients of 
yi in f • Grace 2 



Coefficient of y 1 : [2xix 2 + y 2 (xi + x 2 )]fo 

Coefficient of y\ : [2xix 2 + y 2 (xj + x 2 )]fj + [2y 2 + xj + x 2 ]f 

Coefficient of y\ : [2y 2 + xi + x 2 ]fi 



The coefficient of y\ is a polynomial in y 2/ so the coefficients of y\ and y\ 
interlace: 



Coefficient of y 2 : (xi + x 2 )f + 2xix 2 fi 
Coefficient of y\ : 2f + (xi + x 2 )fj 

It follows that the matrix 



M = 



xi + x 2 2xix 2 
2 xi + x 2 



preserves interlacing pairs of polynomials in two variables. As expected, the 
determinant, (xi — x 2 ) 2 , is non-negative. 
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The sequence of vectors 

J:)- m (;:)' m2 0: 

consists of polynomials in P2. Write M 1 = (j^j ■ The characteristic 

polynomial of M is 

z 2 - 2(xi + x 2 )z + (xi - x 2 ) 2 
and since M satisfies its characteristic polynomial we have 

M n+2 • v - 2(xi + x 2 )M n+1 ■ v + (xi - x 2 ) 2 M n ■ v = 
for any vector v. Therefore, we have the recurrences 

9n+2 - 2(xi + x 2 )g n +i + (xi - x 2 ) 2 g n = 
h n+2 - 2(xi + x 2 )h n+ i + (xi - x 2 ) 2 h n = 

If we want to construct examples then we must begin with g < — ho, and 
not with g 0; gi. For example, if we take a degenerate case go = 1, ho = then 
the first few pairs are 

l\ fx\+ x 2 \ (x\ + 6xix 2 + x 2 \ (x\ + 15x 2 x 2 + 15xix 2 + x\ 
OJ' \ 2 ) ' \ 4xi + 4x 2 )' V 6x i + 20x i X2 + 6x^ 

It's not hard to see that we have an explicit formula 

'2nS 



gn(xi,x 2 ) = Y_ 



i=0 



2i 



H A 2 



and the gt satisfy the recurrence above with initial conditions go = 1 and 

gi=xi+ x 2 . 

16.4 Recursions from f (x, — D y ) and g (x, y ) 

We know that if f(x,y) e P2 then f(x,— D y ) maps P2 — ► P 2 . This guarantees 
that our recursions will consist of polynomials with all real roots. 
We assume 

f(x 7 y) = ^f i (x)y i eP 2 

i 

f(0,0)^0 

d 

g[x,y) ^^g i {x)y i e P 2 

i=0 

The action of f (x, — D v ) never decreases u-degree, so it determines a d + 1 by 
d + 1 matrix acting on the coefficients of polynomials in P2(d). 
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Example 16.6. We assume 

f(x,y) = (x + y) 3 g(x,y) = g (x) + gi(x)y + g 2 (x)y 2 

The action of f (x, — D v ) on g is given by the matrix 

x 3 — 3x 2 6x \ 
x 3 -6x 2 
x 3 / 

This gives recurrences of the form 

Pn - 3x 3 p n _! + 3x 6 p n _ 2 - X 9 p n _3 = 

Example 16.7. In this case we let f = e~* v , and g = gt(x)y\ The matrix is 

/l — X X 2 —x 3 X 4 \ 

1 -2x 3x 2 -4x 3 

1 -3x 6x 2 

1 -4x 

\0 1 / 

The characteristic polynomial is (A — l) 5 , so these recurrences are of the form 

Pn - 5p n _i + 10p n _2 - 10pn-3 + 5p n -4 - Pn-5 = 0. 

16.5 Recurrences from f(— D x ,y) and g{x,y) 

If we use f(— D x ,y) instead of f(x,— D v ) then we get recurrences involving 
derivatives. We assume 

f(x,y) ^^f^xjy 1 GP 2 

i 
d 

g(x,y) =^g i (x)y i e p 2 

1=0 

We give a very simple example - note that the degree of the recurrence comes 
from the number of coefficients we consider, and not from the degree of f . 

Example 16.8. We let 



f(x,y)=y-D x g = ^gi(x)y\ 

We consider only the first four coefficients of g, so the action of y — D x is 

-9o +y(go - g[) +v 2 (m - g 2 ) +y 3 (92 - g^)- 
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As a matrix this is 



/- 


-D x 








° \ 


/go\ 




1 


-D x 








gi 







1 


-D x 





92 


V 








1 


-dJ 


\93/ 



The characteristic polynomial is (A + D x ) 4 , so the recurrence is 

Pn + 4 P ;_! + 6p™ 2 + 4p^l 3 + p^ 4 = 0. 



16.6 Recurrences from mutually interlacing polynomials 

If a matrix preserves mutually interlacing polynomials then we get recursive 
sequences of polynomials in P. 

Example 16.9. The matrix and initial vector 

/l x + 1 x + l\ /x-1 
M= 1 1 x + 1 v= x 

\1 1 1 / V x + 1 

determine three polynomial sequences M 1 v = (fi, gi, hd) that satisfy the re- 
currence 

Pn = 3p n _! +3xp n _ 2 + X 2 p n _ 3 . 

To find the initial terms we need to compute Mv and M 2 v. 

/2x 2 + 4x\ /x 3 + 10x 2 + llx\ 

Mv = x 2 + 4x M 2 v = 6x 2 + llx 

\ 3x / \ 3x 2 + llx / 

It follows that the following sequence of polynomials has all real roots. It also 
appears that ft < fi_i but that doesn't follow from the general construction, 
and I don't know how to prove it. 

ho =x + l 

hi = 3x 

h 2 = 3x 2 + llx 

h n = 3h n _! + 3xhn_2 + x 2 h n _ 3 

Example 16.10. In this example we start with the recurrence, and find a matrix 
that preserves mutually interlacing polynomials. Consider the sequence 

qo = qi = q2 = • • • = qa = i 

qk = qk-i + xq k _ d for k> d 
qa+i=l + x, q d+2 = l+2x, ••• q 2 d+i = 1 + (d + l)x 
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Rewrite the recurrence in matrix terms: 

(I ... x\ /q k . 



\0 



qk-i 



\q 



k-d+l 



The d+1 by d+1 matrix preserves mutually interlacing polynomials. Since 
the vector (qa+i/ • • ■ / q2d+i) is mutually interlacing, we can repeatedly apply 
the Lemma to conclude that (q k/ . . . , qk+d) is a vector of mutually interlacing 
vectors. In particular, all q n have all real roots. 

If a two by two matrix preserves interlacing polynomials then we get a 
recurrence. This is really just a special case of a general matrix preserving 
the first k coefficients of a polynomial, and also a special case of a matrix 
preserving mutual interlacing. 



Example 16.11. The matrix I ^ ^ I preserves interlacing. If we let 



1 



then all p n G P. The characteristic polynomial is A 2 — A 
recurrence 

Pn = Pn-1 + *Pn-2 



x, so we have the 



16.7 Alternative approaches 

There are many specialized ways to prove facts about recurrences. 1541 has 
some very interesting results about recurrences with polynomial coefficients. 

Of course, there is a 1-1 correspondence between recurrences with polyno- 
mial coefficients and rational functions. Here we start with a rational function. 

If we write 

1 ^~ H-n.iM 



then we will show that the roots of the coefficients h n j lie on n equally spaced 
rays through the origin. For example, if n = 2 we have 

1 1 2x 3x 2 + l , 4(x 3 + x) , 

tv + t. — + -7 — +■■■ 



l-(x + y) 2 1-x 2 (1-x 2 ) 2 (1-x 2 ) 3 (1-x 2 ) 4 

and all the roots lie on the imaginary axis. There is a simple expression for 
these coefficients 



2jx + yr •£ 



x 
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The coefficient of y > is x ' g (x n ) where 

The series converges for |x| < 1. If x is positive then the series is positive or 
divergent. Thus, g n ,j has no positive roots. Here are some values of the sum 





n = l 


n = 2 


n = 3 




l 

l-x 


l 

l-x 


l 

l-x 


j = 1 




2x 


3x 


(l-x) 2 


(l-x) 2 


(l-x) 2 


i =2 




3x 2 +x 


6x 2 +3x 


(l-x)* 


(l-x) 3 


(l-x) 3 



We will show that f n j e P, which establishes the claim. We write g n j (x) 
in terms of a differential operator. First, let ( n *) = Y. aicX k . 



9n,j(x) = Y_ ( ■ ) xl = Y. 

i ^ - 1 ' i,k 

= £ a k (xD)V = QjxDjV)^ = 

W i 

It simplifies the argument if we eliminate the denominator, so we will 
show that Fj e P where 

(uxD)(nxD - 1) • • • (nxD - j + 1)^1- = 

This yields the recurrence 

F n ,j+i = [((j + l)n + j)x + j]Fn,, -x(x-l)F;j 
This recurrence is of the form 

p n = (ax + b)p n _! - x(x - l)Vn-l 

Since x(x — 1) is positive at the roots of p n it follows that all fj have all real 
roots and all positive coefficients. 



/nxD\ _J_ 



Chapter 



Matrices 



The theme of this chapter is that there are several ways to represent sequences 
of interlacing polynomials by Hermitian matrices. There is a 1-1 correspon- 
dence between Hermitian matrices and interlacing sequences of monic poly- 
nomials. Since orthogonal polynomials can be realized as the characteristic 
polynomials of certain Hermitian matrices, many results about orthogonal 
polynomials can be generalized to Hermitian matrices. 

17.1 Basics 

We first recall some basic facts about matrices. A* denotes the conjugate trans- 
pose of A. The matrix A is Hermitian if A = A* . Now we are only considering 
real polynomials, so we could restrict ourselves to symmetric matrices, but it 
is no harder to consider Hermitian matrices. A matrix U is unitary if UU* = I, 
where I is the identity matrix. A Hermitian matrix has all real eigenvalues. 
The spectral theorem says that there is a unitary matrix U and a diagonal ma- 
trix A consisting of the eigenvalues of A such that A = UAU* . If all entries of 
A are real, then U can be chosen to be real, so that it is an orthogonal matrix. 

The characteristic polynomial of A is xl — A|, and is written CP (A). The 
set of roots of the characteristic polynomial of A is written A (A). We say that 
A<Bif CP(A)< CP(B), A<Biff CP(A) < CP(B), and so on. A Hermitian 
matrix is positive definite if all of its eigenvalues are positive, and is positive 
semidefinite if the eigenvalues are all non-negative. 

If the matrix A is invertible then interlacing properties of A determine in- 
terlacing properties of A -1 . 

• If A is invertible and A (A) = (qi, . . . , a n ), then A(A _1 ) 

• If A is positive definite and A < B then B _1 < A -1 . 

• If A is positive definite and A < B then A -1 < B _1 . 



1_ 

ai 
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• The characteristic polynomial of A 1 is 

|xl - A" 1 ! = | - A" 1 ! |I — xA| = | — A" 1 ! |(l/x)I - A|x n 

where A is n by n. Since the reverse of a polynomial f is f REV = x n f (1/x), 
we can write 

CP(A" 1 ) = — L-fcPtA))^ 
17.2 Principle submatrices 

In this section we establish the important result that Hermitian matrices cor- 
respond to sequences of interlacing polynomials. 

Definition 17.1. For n by n matrix M , and index sets a, |3 c {1, . . . ,n}, the 
(sub)matrix that lies in the rows indexed by a and columns indexed by |3 is 
denoted M{oc, (3}. If a = (3 then we write M{a}; such a submatrix is called a 
principle submatrix (§ 110110.141 . Sometimes we want to refer to a submatrix via 
the deletion of rows; in this case we let M[ct] be the submatrix resulting from 
the deletion of the rows and columns not listed in a. The size |a| of an index 
set a is its cardinality. 

We know that if A is a principle submatrix of B then B < A. 

Suppose that f is a polynomial with roots roots(f) = (m,. . ., a n ). Set 
A equal to the diagonal matrix whose elements are (a\, . . . ,a n ), and let 
y* — (yi, . . . ,y n ). The following calculation shows the relation between the 
characteristic polynomial of A and an extension of A. Suppose that 



A = 



(a : v \ 
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The characteristic polynomial of A is 

fxl-A : tj ^ 



det(xl - A) = det 



det 



V -V* '■ x-a/ 



Vttxi-A)-^]* ; 1/ 
i ; ui-A)- 1 ^ 



xl — A 



V -y* 



x — a/ 



U- 



det 



A 







V 



: (x-a)-y*(xI-A)-V 
= [(x-a) -y*(xI-A) _1 y] det(xl — A) 

= x - a f x - > yf 

* — x — a; 

i=l 

If Pi is the characteristic polynomial of A{1, . . . , i}, then from repeated ap- 
plications of Theorem |1.61| we have a sequence of interlacing polynomials 

Vn<Vi-l< ■ ■ ■ <Vo 

Conversely, we can use the above calculation to show that given the pi's, we 
can find a Hermitian matrix determining them. 

Theorem 17.2. Suppose that po,pi,. ■ .,p n is a sequence of polynomials such that 
the degree ofpi is i, and Pi < Pi-i/or 1 < i < n. There is a Hermitian matrix A such 
that the characteristic polynomial of A{1, . . is a multiple of pi, for ^ i < n. 

Proof. We prove this by induction on n. The case n = 1 is clear, so assume 
that A is an n by n Hermitian matrix such that the characteristic polynomial 
of A{1, . . . , i} is a multiple of pi, for ^ i ^ n. Choose an orthogonal matrix 
U so that UAU* is the diagonal matrix A whose diagonal consists of the roots 
of Pn. We need to find a number b and vector z* = [zi,...,z rL ] so that the 
characteristic polynomial of 



A 



(17.2.1) 
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is equal to a multiple of p n +i- Write 



B = 



(u 


°) 


(a 






: o\ 


(a 


y\ 


\o 


V 


v- 


V 


u 


: 1/ 


\V 


a/ 



(17.2.2) 



The characteristic polynomial of A is cp n for some constant c. The charac- 
teristic polynomial of 1117.2. lb is the same as the characteristic polynomial B in 
l|17.2.2l l. Since p n +i <p U ; we can apply Lemma [1.20| and find an a and y? such 
that 

V" 2 PnW 



Pn+l = 

where roots(p n ) = (m, . . ., a 



a)p T 

). The desired matrix is U*BU. 



□ 



If Pi+i < pi for all i then all of the coefficients y? are non-zero. In this case 
there are 2 n choices of yi's, and so there are 2 n different A's. 

Example 17.3. The construction of a matrix from a sequence of characteristic 
polynomials involves solving many equations. Here are a few examples of n 
by n matrices M where we are given a polynomial f and the determinant of 
the first i rows and columns is a constant multiple of f . 



1. If f = fx- a)[x-b) then M = 



2. Iff = (x-l)(x-2)(x-3)thenM= ^= 



3. Iff 



1) then M 




Example 17.4. If a Hermitian matrix H is partitioned into diagonal blocks, 
and the polynomials determined by each of these blocks strictly interlace, we 
do not have strict interlacing for the polynomials determined by H. Assume 
that A is an r by r Hermitian matrix determining the polynomial sequence 
p r < • ■ ■ < Pi, and B is an s by s Hermitian matrix determining the polynomial 
sequence q s < • • • < qi. The partitioned matrix 



A 







\0 : Bj 
determines the polynomial sequence 

Pl >p 2 > • • Pr>Pr qi>Pr <l2> ■ ■ ■ >Pr q S 
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If a Hermitian matrix M is diagonal with diagonal entries {a\, . . . ,a n ), 
then the characteristic polynomial is f(x) = [x — a\) ■■■{%— a n ). The charac- 
teristic polynomial of a single row/column deletion M[i] equals f(x)/(x— ai). 
Since these are the polynomials occurring in quantitative sign interlacing it is 
not surprising that the characteristic polynomials of single deletions are also 
important. For instance, an immediate consequence of Theorem 11.611 is that 
for any choice of positive the polynomials 



^ tH CP(M[i]) 



i=l 



have all real roots. 

Analogous to the formula 



we have 



dx f — x — at 

i=l 



— CP(M) = Y CP(M[i]) 
dx z — 

i=i 

This can be established by considering the derivative of only the terms in 
the expansion of the determinant that have diagonal entries in them. 

Example 17.5. It is not the case that the characteristic polynomials of the single 
deletion submatrices of an n by n matrix span an n-dimensional space. The 
matrix M = ( J J ) has eigenvalues 1, —1 but M[l] = M[2] = (0), so they have 
the same characteristic polynomial, x. More generally, for any symmetric C 
with distinct eigenvalues none of which are 1 or —1, the matrix M below has 
M[l] = M[2], so the characteristic polynomials do not span an n-dimensional 
space. 

/0 1 0\ 

10 
\0 C/ 

An n by n matrix M has 2 n principle submatrices. Their characteristic 
polynomials can be combined to form polynomials that are in P n +i- 

Lemma 17.6. Let M be an n by n symmetric matrix. For each a c {1, . . .,n} let 
x a = Yliea x i-- Then the polynomials below are in P n+ i. 

Y_ CP(M{a})x a 

«C{l,..,ti} 

Y_ detM{a}x a 

£tC(l,..,u) 
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Proof. The polynomial f (x,x) in question is nothing other than the character- 
istic polynomial of M + D where D is the diagonal matrix whose diagonal is 
(xi, . . .,x n ). It remains to see that f G P n +i- Consider g e = CP(M + D + ezl) 
where z = xi + • • • + x n . The homogeneous part of g e is YIi( x + xi + e(xi + 
• • • + x-n) and for e > this is in P n +i- If we substitute values for x\,...,x n 
then the resulting polynomial is the characteristic polynomial of a symmetric 
matrix and so is in P. Since lim e ^o+ 9e = f it follows that f is in the closure of 



The second assertion follows from the first by setting x = 0. 



□ 



For example, if M 



q b 

.be, 



then the two polynomials are 



det 



a— x — xi b 

b C — X — X2 



det 



a— X! b 
b c— x 



= (x — a)(x — c) — b 2 + (x — a)x2 + (x — c)xi + X1X2 
= (ac — b 2 ) — ax2 — cxi + X1X2 



17.3 The Schur complement 

The Schur complement of a matrix plays a role that is similar to contraction 
for a graph. Although the initial definition is unintuitive, there is a simple 
interpretation in terms of deletion in the inverse that is easy to understand. 

Definition 17.7. For an n by n matrix M , and index sets a, p c {1, . . .,n} 
where a and |3 have the same size, the Schur complement of the (sub)matrix 
that lies in the rows indexed by a and columns indexed by (3 is 

A/ [a, |3] = A{a', |3'} - A{a', (3}A{cc, pr : A{a, (3'} 

If a = (3 then we write M/[a]. Notice that if a has size k, then the Schur 
complement has dimension n — k. 

Example 17.8. If M is diagonal with diagonal entries [a\, . . ., a n ), and a c 
{!,..., n}, then the Schur complement M/[a] is the diagonal matrix M{oc'}. 

The Schur complement may be computed by taking the inverse, deleting 
appropriate rows and columns, and taking the inverse of the resulting matrix. 

Theorem 17.9 ([5]). If A is invertible, then A/ [a, (3] = (A _1 {P', a 7 })" 1 

An interesting property is that two disjoint deletions commute. 

Theorem 17.10 (O). If cti, ocj, Pi, P2 are index sets with \a.\\ — |(3i|, I0C2 = IP2I 
and oi\ n ot2 = |3i n P2 = then 

(M/tai, PJ) /k 2 , PJ = (M/[a 2 , PJ) /[ai, Pi] = M/[ai U a 2 , Pi U p 2 ] 
We can now show that Schur complements preserve interlacing. 
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Theorem 17.11 ( 111601 ). If A is a definite (positive or negative) Hermitian matrix, 
and if\a\ — |6 = 1, then A<A/[a, (3], 

Proof. Since A -1 is Hermitian and its principle submatrices interlace, we get 
A -1 < A _1 {(3', a'}. Since all eigenvalues of A have the same sign, when we 
take inverses interlacing is preserved: 

A= (A- 1 )" 1 < (A- 1 {P',o , })~ 1 = A/[a,(3] 

□ 

Corollary 17.12. If M zs non-singular definite Hermitian, and we set p^ = 

CP(M/[{l..k}]) then 

Pn <Pk-l < ■ ■ • <pi 

Proof. Since ( M/[{l..k }])/[{k + 1}] = M/[{l..k + 1}] by TheoremUZlOl we can 
apply Theorem ll7.11l □ 

Another simple consequence is that we can realize polynomial sequences 
by characteristic polynomials of Schur complements. 

Theorem 17.13. Suppose that po,Pi, ■ • ■ ,p n is a sequence of polynomials such that 
the degree o/pi. is i, and pi <_Pi-i for 1 < i < n. There is a Hermitian matrix A 
such that the characteristic polynomial o/A/[{l, . . .,i}] is a constant multiple o/pt, 
for ^ i ^ n. 

Proof. Let B be a Hermitian matrix such that CP(B{{l..k}}) = (pO REV . B" 1 is 
the desired matrix. 

□ 

17.4 Families of matrices 

We consider properties of the eigenvalues of continuous families of Hermitian 
matrices. We begin with a definition of a locally interlacing family of matrices. 

Definition 17.14. If {H t } is a continuous family of Hermitian matrices such 
that the family of their characteristic polynomials is a locally interlacing fam- 
ily, then we say that {H t } is a locally interlacing family . 

If the entries of a Hermitian matrix are functions of a parameter, then we 
can determine the derivatives of the eigenvalues with respect to this parame- 
ter. This result is known as the Hellmann-Feynman theorem: 

Theorem 17.15. Let H(x) be an n by n Hermitian matrix whose entries have 
continuous first derivatives with respect to a parameter x for x e (a, b). Let 
A(H(x)) = (Ai(x),. . .,A n (x)) be the eigenvalues of 'H(x), with corresponding eigen- 
vectors ui, . . . ,u n . If H' is the matrix formed by the derivatives of the entries of H(x), 
then for j = 1, . . .,n 

dAj _ u* H' Uj 
dx u^Ui 



CHAPTER 17. MATRICES 



536 



Proof. More generally, we consider an inner product for which (H t u,v) = 
(u, H t v), (u,v) = (v,u), and (v,v) > if v 0. From the fact that the ut 
are eigenvectors, we find that for any s ^t 

(H t Ui(t),Ui(s)) = Ai(t) (uM^ds)) 
(H s Ui(t),Ui(s)) = (H s Ui(s),Ui(t)) 

= Ai(s) (ut(t),ut(s)) 

Subtracting and dividing by t — s gives the difference quotient 
Hl ~ Hs ut(t),Ui(s)) = ^i (ui (t),ut(s)) 



t — s / t — s 

The result follows by taking the limit as s goes to t. □ 

Corollary 17.16. If H(x) is an nby n Hermitian matrix whose entries have con- 
tinuous first derivatives with respect to a parameter xfor x 6 (a,b), and H'(x) is 
positive definite for all x, then {H t } is a locally interlacing family. 

Example 17.17. Suppose that H is Hermitian. Consider the family {e tH }. The 
derivative of this family is 

— (e tH ) = He tH 
dt v ' 

If v is an eigenvector of H with eigenvalue a, then He tH v = H(e at v) = ae at v 
and so v is an eigenvector of H' with eigenvalue ae at . These eigenvalues 
are increasing if and only if a is positive, so we conclude that {e tH } is locally 
interlacing if and only if H is positive definite. 

Corollary 17.18. If {H t } is a family of Hermitian matrices such that ^-H t is strictly 
totally positive, then the families of characteristic polynomials of any principle sub- 
matrix are locally interlacing. 

Proof. Since every principle submatrix of H' is positive definite, the result fol- 
lows from Corollary |17.16l □ 

The next result is known as the monotonicity theorem, since it states that 
adding a positive definite matrix increases all the eigenvalues. 

Corollary 17.19. If A, B are Hermitian matrices, and B is positive definite, then the 
family {A + tB} is locally interlacing. 

Proof. The derivative ^(A + tB) is simply B. □ 

Example 17.20. Consider the family {H + tl}, where H is Hermitian. Since 
I is positive definite, it is an locally interlacing family. If the characteristic 
polynomial of H is f (x), then 

||xl - (H + tl)| = |(x- t)I - H| = f (x - t) 

Another elementary family is given by {tH}. Since |xl — tH| = f (x/t) it is 
locally interlacing. 



CHAPTER 17. MATRICES 



537 



We can relate interlacing of polynomial sequences with their Hermitian 
matrices, but first we need to recall the positive definite ordering (p. 14581 1 of 
Hermitian matrices. 

Lemma 17.21. Suppose that A, B are Hermitian. They determine polynomial se- 
quences pi = ||xl — A{l..i}| and — ||xl — B{l..i}|. If B -< A then pi -< q\for 
1 < i < n. 

Proof. Consider the family M t = tA+ (1— t)B. The derivative is A— B which is 
positive definite, so it is a locally interlacing family Since Mo = B and Mi = A 
it follows that the roots of M t are increasing as a function of t, so pi -< qi. 

□ 



17.5 Permanents 

If A = (ay) is a matrix then the permanent of A, written per (A), is the deter- 
minant without alternating signs: 

n. 

per(A) = Y_ EI ai '°-W 

crGS^ 1=1 

Unlike the determinant, polynomials such as per(xI+C) where C is symmetric 
almost never have all real roots. If we define J to be the all 1 matrix, then there 
is a beautiful conjecture Il76l 

Conjecture 17.22. If A = (ay) zs a real nby n matrix with non-negative entries 
which are weakly increasing down columns then the permanent ofA + x] has all real 
roots. 

The permanents of principle submatrices of A + xj do not interlace the 
permanents of A + xj, but we do have common interlacing: 

Lemma 17.23. Suppose that A satisfies the conditions above, and the conjecture is 
true. Then per(A + xj) and per( (A + xj) [1] ) have a common interlacing. 



Proof. If we let M = A + xj so that M = M where 

M a = 



an + x + a 
w + x 



v + x 
A[l]+xJ[l] 



By linearity of the permanent we have 



per(M a ) =per 



an + x v + x 
w + x A[l]+xJ[l] 



aper(A[l]+xJ[l]) 



If we choose a to be negative then the conjecture implies that per(M a ) has 
all real roots. Consequently, per(M) + a per(M[l]) has all real roots for all 
negative a. This implies that per(M) and per(M[l]) have a common interlac- 
ing. □ 
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Remark 17.24. The proof can be easily modified to show that if we remove 
the first row and any column from A then the permanents have a common 
interlacing. Empirically it appears that all the submatrices have a common 
interlacing. 



17.6 Matrices from polynomials 

If A is a constant matrix and f (x) is a polynomial then f (xA) is a matrix whose 
entries are polynomials in x. We look for conditions on A or f that imply that 
all entries of f (xA) are in P. 

A simple case is when A is a diagonal matrix diag(ai, . . . , a n ). The result- 
ing matrix f (xA) is a diagonal matrix. To check this, write f (x) = Y. biX 1 : 



f(xA) =^b t x t diag(ai,... / ai x )) 

= diag b^xc^) 1 , . . . b^xaj 1 ) 

= diag(f(aix),...,f(a n x)) 

If A is diagonalizable then we can write A = SAS -1 where 
A = diag(di, ...,d n ). The entries of f(xA) are linear combinations of 
f (dix), . . . , f (d n x) since 



f (xA) = Y_ biX l SA l S _1 

= S (Y_ b t x l A l ) S- 1 

= S diag(f(dix), . . .,f(d n x)) S _1 

For example, if A = ( J * ) then A is diagonalizable, and A n = ( J ) 
so that 



TlXAJ "I f(Qx) 



f(x)) 



It is easy to find polynomials f for which the entries of the above matrix 
are not in P. 

A different kind of example is given by A = ( { J ) which is not diagonal- 
izable, yet f (xA) has an especially simple form. Since A n = ( ^ J ) it follows 
that 

f(xA) = 5>x l (lS) = ( x $,?) 

This example can be generalized to the matrices whose entries above the 
diagonal are 0, and the rest are 1. For instance, when n = 4 



(I 










n ( 


1 


l 










1 


l 


1 







V 


l 


1 


V 





1 

(?) 1 

( n 2 +1 ) (?) 



0\ 



1 
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It follows easily that 

f(xA) -- 



( f 

xD(f) f 

xD 2 (xf) xD(f) f 

\xD 3 (x 2 f) xD 2 (xf) xD(f) ij 



Thus for these matrices f (xA) has all its entries in P for any choice of f . It 
is easy to characterize those polynomials for which f (xA) is a matrix whose 
entries are all in P for every choice of A. 

Lemma 17.25. If f is a polynomial then the following are equivalent: 

1. f (xA) has all entries in P for every matrix A. 

2. f(x) = cx n (ax + b)for constants a,b,c. 
Proof. If (2) holds then 

f (xA) = cx n (axA n+1 + bA n ). 

Consequently, all entries of f (xA) are of the form x n times a linear term, and 
so all entries are in P. 

Conversely, assume f (xA) has all entries in P for any choice of A. Consider 
the matrix A 



_°jq) and its powers: 



(°iS) 4k = (SS) 

If f (x) = YJq a i xl then 
t(xA) - ^_ ^ x4i+1 _ Q4 . +3 x4i+3) 

_ / f e (-X 2 ) Xf (-X 2 )\ 
" ^-Xf (-X 2 ) f e (-X 2 ) ) 

where f e and f are the even and odd parts of f . If we consider B = ( J J 
a similar computation shows that 



/ o jN4k+l _ / o n 
l-ioj - I -10 J 

/ o l\4k+3 _ / _n 

I -io J -hoi 



LtOli+l X 4i + 1 -04i + 3X«+ 3N 

2^ t a 4 i x 4t - a 4i+2 x 4l+2 y 



(So) then 



f(XB) - 1,-Xf (x 2 ) f e (x 2 ) 



Now f e (x 2 ) e P iff f e G P Qlt , and f e (-x 2 ) e P iff f e e P pos , and hence 
f e (x) = ax r for some a, r. Similarly f (x) = bx s for some b, s, and thus f (x) = 
ax 2r + bx 2s+1 . If we choose A = (1) we see that f e P, and hence 2r and 2s + 1 
must be consecutive integers. This concludes the proof of the lemma. □ 



Chapter 

18 



Matrix Polynomials 



A matrix polynomial is a polynomial whose coefficients are matrices. Equiva- 
lently, it is a matrix whose entries are polynomials. We assume all matrices 
are \x by [i. We write 

f (x) = A + ■ • • + A n x n 

The leading coefficient of f (x) is the matrix A n . We are interested in the class 
of hyperbolic matrix polynomials that are a nice generalization of P. 

Definition 18.1. The hyperbolic matrix polynomials in d variables of degree n, de- 
noted Hyp d (n), consists of all matrix polynomials 

f (x) = Y. A i xI sucn mat 

1. < f • v, v >= v t f(x)v G Pd for all v e R^. 

2. f (x) has degree n. 

3. If |I| = n then Ai is positive definite. 

Hyp d os (n) is the subset of Hyp d (n) where all coefficients are positive defi- 
nite. If we don't wish to specify the degree we write Hyp d and Hyp d os . 

A hyperbolic matrix polynomial of degree zero is a constant matrix, and 
must be positive definite. 

If g e Pd/ and P is a positive definite matrix then gP € Hyp d . Although 
this is a trivial construction, the embedding P d — ► Hyp d is important. 

18.1 Introduction 

In this section we establish a few simple properties about hyperbolic matrix 
polynomials. The role of positive coefficients in P is played by positive def- 
inite polynomials in Hyp d . For instance, a matrix polynomial of degree 1 in 
Hyp d has the form 

AiXi H + A d x d + S 
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where the are positive definite, and S is symmetric. If S is positive definite 
then the polynomial is in Hyp^ os . 

The analog of multiplying by a positive constant is replacing f by A*f A. 
The leading coefficients are positive definite, and if w = Av then 

< A*fA ■ v,v >= v t A t fAv =< f • w,w >e P d - 

In the case that d is 1 then we can multiply on the left and right by the 
square root of the inverse of the leading coefficient, and so we may assume 
that the coefficient of x n is the identity matrix. 

Lemma 18.2. Any principle submatrix of a matrix in Hyp d is also in Hyp d . In 
particular, all diagonal entries are in P d . 

Proof. This is immediate, for we just need to set the i-th coordinate of v equal 
to zero to show the hyperbolic property for the i-th principle submatrix. The 
leading coefficients are positive definite since all principle submatrices of a 
positive definite matrix are positive definite. □ 

Lemma 18.3. Suppose that f(x) is a diagonal matrix diag(fi(x), . . .,f^(x)). The 
following are equivalent: 

1. f G Hyp v 

2. fi, . . . , f p. have a common interlacing. 
Proof. Ifn. = 2then 

(a,b) (* f ° 2 ) (a,b) t = Q 2 fj + b 2 f 2 g P 

Thus all positive linear combinations of fi and f 2 lie in P, and so fj and f 2 
have a common interlacing. It follows that any two of the fi have a common 
interlacing, and so they all have a common interlacing. 

Conversely, if they have a common interlacing then all positive linear com- 
binations are in P, and so 

(a 1 ,...,a^)diag(f 1 ,...,f^)(a 1 ,...,a,J t = a 2 fj + • • • a 2 ^ G P. 

□ 

Remark 18.4. We can use this result to show that the tensor (Kronecker) prod- 
uct does not preserve hyperbolic matrix polynomials. If f = ( J x ° a ) then 
f ® f <g> f is an 8 by 8 diagonal matrix whose diagonal contains x 3 and (x — l) 3 . 
Since these two do not have a common interlacing, f ® f <g> f is not in Hypj. 

Replacing x by Ax in a polynomial with all real roots does not usually yield 
a hyperbolic matrix polynomial. 

Corollary 18.5. Suppose that A is a positive definite matrix with eigenvalues 
di, . . ., d^, and f G P. Then f (xA) G Hyp 1 if and only if {f(dix),. . .,f(d^x)} 
have a common interlacing. 
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Proof. Let O be orthogonal, and D diagonal so that A = 0*00. Then 

f(xA) =0*f(xD)0 = O t diag(f(xd i ))0 

Thus, f(xA] G Hyp! if and only if diag(f(xdi)) G Hypj. Lemma [183] com- 
pletes the proof. □ 

Remark 18.6. There is a simple criterion for {f(dix}i to satisfy the corollary. 
Suppose that roots(f) = (r^), and a > 1. The roots of f (ax) are T\/a < ■ ■ ■ < 
r n /a. If they have a common interlacing then 

Ti/a < ti < r 2 /a < r 2 • • • 

and thus a < mini>j Ti/rj. If < a < (3 then f (ax) and f(|3x) have a common 
interlacing if and only if f (x) and f (|3x/a) do. We conclude that {f (dix)} has a 
common interlacing if and only if 

di . . Ti 
max — ^ mm — 

i,j dj i>) Tj 

Note that if this condition is satisfied then they are actually mutually interlac- 
ing. 

If we add off-diagonal entries that are all equal, then we can determine 
when the matrix polynomial is in Hyp^ Define 

aid 2 + a 2 a 3 + a 3 a 4 H h a n _ia n 

oVi = max 



a\ + ■ ■ ■ + al 



Some values of On are 



i Vi l + VE V3 

^2 = 2 CT 3 = ~Y ff 4 = 1 0"5 = — 

Lemma 18.7. The nbyn tridiagonal matrix below is in Hyp l if and only if f+2ag G 
P for |a| < cr n . 

/f g ... \ 
g f g ... 
™-M = g f g . . 

\o o ••• ••. •./ 

Proof. If v = (ai, . . . , a n )', then 

v t rav = (af H h c£)f + 2(aia 2 + a 2 Q3 + a 3 a 4 H h a n _ian.)g 

Dividing by the first factor, we see that this is in P if and only if f + 2ag G P 
for |a| < cr n . □ 

The off-diagonal element can even be a constant if it is small enough. 
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Corollary 18.8. Suppose that f G P. The following are equivalent: 

2- (£?)e HyPi 
2. |c| < min |ffa)| 

f'(a)=0 

Proof. Geometrically, the second condition means that the largest value of |c| 
satisfies f + b e P for all |b| < |c|. Since 2 02 = 1/ the result now follows from 
the lemma. □ 

The off diagonal entries are not necessarily in Pd- This can be deduced 
from Lemma [l8.7l by taking g small relative to f . For example, 

( x 2 -l e(x 2 + l) 
^e(x 2 + 1) x 2 - 1 

The reverse preserves Hypj. 

Lemma 18.9. Iff[x) — 2^a=o^ x1 e ^VPv an & zs positive definite, then the 
reverse f rev (x) = 2~Z£=o Ai.x. n_i is also in Hyp 1 . 

Proof. Since Ao is positive definite, the leading coefficient of f rev is positive 
definite. The conclusion follows from the calculation 

< vf rev ,v >=< vx n f (l/x),v >= x n < vf (l/x),v >=< vf,v > Tev . 

□ 

The product of hyperbolic matrix polynomials is generally not hyperbolic 
since the product of the leading coefficients, which are positive definite matri- 
ces, is not usually positive definite. We do have 

Lemma 18.10. iff, g G Hyp x then f 1 g f G Hyp v 

Proof. If A is the leading coefficient of f and B is the leading coefficient of 
g, then the leading coefficient of f 1 g f is A*bA which is positive definite. If 
v G and w = f v then 

v t f t gfmv=w t gwe P 

□ 

18.2 Interlacing 

There are two equivalent ways to define interlacing in Hyp d : the usual linear- 
ity definition, or reduction to one variable. We start with the latter, since it is 
more precise. 



J G Hyp! |d sS 1 



CHAPTER 18. MATRIX POLYNOMIALS 



544 



Definition 18.11. Suppose that f, g e Hyp d have degree n and n — 1 respec- 
tively. We define 

f < g if and only if v*fv < v* gv for all v e M H \ 

f<g if and only if v t fv<v t gv for all v e IR H \ 

<C and <|C are defined similarly 

Lemma 18.12. Suppose that f , g e Hyp d have degree n and n — 1 respectively. The 
following are equivalent: 

1. f + ag e Hyp d for all ael 

2. f <g 

Proof. By hypothesis we know that 

v t (f + ccg)v = v*fv + av*gv e Pd for all a. e M 
and consequently we have that v*fv < v t gv. The converse is similar. □ 

We now have some easy consequences of the definition: 
Lemma 18.13. Assume that f, g, h e HyPd- 

2- Iff < 9 an d f < H then f < g + h. In particular, g + he HyPa- 

2. Ifi < g < hthen f — h<cg. 

3. I/k e Pd f^ew fk e Hyp d . 

4. If flZ/ en are positive and f e /~/yp d fnen 

(aixi H a d x d + b)f <f 

5. If f/ze i-f/z diagonal element off is fi, and of g is gt then f 4£ g implies ft <c q-i.. 

6. If F < G in Pd awd Mi, M 2 are positive definite matrices then FMi<G M 2 in 

"/Pd- 

7. If gi, . . . , g r /iflZ7e a common interlacing in Pa, and Mi, . . . , M T are positive 
definite matrices then gi Mi + • • • + g T M r e Hyp d . 

Proof. By hypothesis we know thatv t fv<v t gvandv t fv<v t hv. Since f, g, h € 
Hyp d we know that the leading coefficients of these three terms are positive, 
so we can add inter lacings in Pd: 

v t fv<v t gv + v t hv = v*(g + h)v 
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The proof of the second one is similar. The third one follows since P a is closed 
under multiplication, and k is a scalar: 

< fk • v,v >= k < f • v,v >e P d 

The fourth follows from the third one. For the next one, note that if z\ is 
the vector with 1 in the i-th place and zeros elsewhere, then ejTet is the i-th 
diagonal element of f . 

If F < G then v* Mi v and v* M 2 v are positive so v t FMiv < v l G M 2 v. If G < gt 
for 1 ^ i ^ r then G l<gi M.\ by the previous result, and the conclusion 
follows by adding inter lacings. □ 

The interlacing of even linear polynomials leads to some complexity. 

Lemma 18.14. Suppose that A, C are positive definite, and B, D are symmetric. 

1. xl — B < xl -DiffB^D (i.e. B - D is positive definite). 

2. xA-B<xC-DzJ 

v t Bv v*Dv 

— - > — - for all v (18.2.1) 

Proof. If xl — B <C xl — D then for all non-zero v we have 

V t (xl-B)v<v t (xl-D)v 
(v*v)x - v t Bv < (v*v)x - v*Dv 

These are linear polynomials with positive leading coefficients, and they in- 
terlace iff v*Bv ^ v*Dv. Thus v t (B - D)v ^ 0, so B ^ D. 
In the second case 

v t Avx — v*Bv<C v*Cvx — v*Dv 

for all non-zero v. Solving each equation for x and using the first part yields 
the conclusion. 

□ 

The condition 1118.2. Il l is not well understood. Let A mi . n (W) and A mQX (W) 
be the smallest and largest eigenvalues of a matrix W. Suppose that matrices 
A, B, C, D are as in the lemma and (|18.2.11 is satisfied for all non-zero v. If W 
is any symmetric matrix then the Rayleigh-Ritz inequality [85J says that 

vtWv 

A min ( W) < — - < A mQX ( W) for all v ^ 

We may assume that v has norm 1. The values of v l Wv lie in the interval 
(A m in(W), A max (W)). If (1523) holds it follows that 
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(v*Bv) (ytDv) ^ (v*Av) (v l Cv) for all v 

(D) (A)A min (C) (18.2.2) 

The values of lie in the interval [A min (BA" 1 ) / AmaJBA" 1 ] and there- 
fore 

AmintBA- 1 ) ^ A^fDC- 1 ) and (18.2.3) 

AnxinfBA-MA^nfCD- 1 ) ^ X 

Conversely, if we know that the stronger condition 

AmaxtBA- 1 ) < A^ntCD- 1 ) 

holds then we can conclude that J18.2.1|l holds. 

Hyp d behaves well under induced linear transformations. 

Lemma 18.15. Suppose that T: P d — > P d , and if d is 1 then T preserves the sign 
of the leading coefficient. Define a map on Hyp d by T{Y_ Ajx 7 ) = Y_ AjT(x 7 ). Then 
T: Hyp d — ► Hyp d . 

Proof. If f 6 Hyp d then the leading coefficient of T(f ) is positive definite. The 
result now follows from 

T < f • v,v >=< T(f) • v,v > 

□ 

The derivative behaves as expected. The proofs follow from the previous 
lemma, and the arguments for P. 

Lemma 18.16. Suppose that f, g e Hyp d and h e pP° s . 

& e Hyp d . 

4. h(9/3xi)f € Hyp d 
Example 18.17. What does the graph of a diagonal matrix polynomial look 
like? Consider f = ^ '" _1 ' ' x_3 ' ( x -2)°(x-4) ) • We have the interfacings 

(x-l)(x-2)I<f<(x-3)(x-4)I 

If we graph (l,v) t f(l,v) then Figure [18.11 shows that it stays in the regions 
determined by (x — 1) (x — 2) =0 and (x — 3) (x — 4) = 0. The small dashed 
lines are the graph of (x — l)(x — 3) =0, and the large dashed lines are the 
roots of (x — 2) (x — 4) = 0. This is a general phenomenon - see page 15561 
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Figure 18.1: The graph of a diagonal matrix polynomial 
18.3 Common interlacings 

If f € Hypj and g e P, then f < gl if and only if v*fv< g for all v e W 1 . If 
either condition holds we say that f has a common interlacing. 

The usual definition! 122J of hyperbolic polynomials allows the vectors v 
to be complex. This is equivalent to the existence of a common interlacing. 
Recall that w* is the conjugate transpose. 

Lemma 18.18. Suppose that f is a matrix polynomial whose leading coefficient is the 
identity. The following are equivalent. 

1 w*fw e P for all w e O. 

2a {v*fv | v e M^} has a common interlacing. 

2b f has a common interlacing. 

Proof. 2a and 2b are equivalent by definition, so assume that 1 holds. Write 
w = u + iv where u, v are real vectors. Then 

w*fw = u*fu + u*f (iv) + (iv)*fu+ (xv)fiv 
= uVu + v*fv 

Replacing v by av shows that u*fu + a 2 v*fv £ P for all a, so u*fu and v f fv 
have a common interlacing. Conversely if all u'fu and v*fv have a common 
interlacing then w'fw e P. □ 

Do all hyperbolic polynomials in Hyp 1 have a common interlacing? This 
is an unsolved question, but it is true for two by two matrices. 

Lemma 18.19. Any two by two matrix F e HyP\ has a common interlacing. 



CHAPTER 18. MATRIX POLYNOMIALS 



548 



Proof. If we write F = ( £ £ ) and let v = ( (3, 1 ) then 

vtfv = (3 2 f + 2(3 h + g 

Lemma [3.501 guarantees the existence of a common interlacing. □ 

There is a simple condition that implies a common interlacing. 

Lemma 18.20. Assume f (x) = x 2 I + Ax + B where A, B are symmetric. If there is 
an oc such that f (a) is negative definite then f has a common interlacing. 

Proof. Since v t f(a)v < and v t f(8)v > for (3 sufficiently large, it follows 
thatf <(%- a) I. □ 

We have seen that the polynomials of a diagonal hyperbolic matrix poly- 
nomial have a common interlacing. This is true in general. 

Corollary 18.21. The diagonal elements of a polynomial in Hyp 1 have a common 
interlacing. 

Proof. Any two by two principle submatrix is in Hypj, and by Lemma [18. 191 
the diagonal elements have a common interlacing. Since any two diagonal 
elements have a common interlacing, they all have a common interlacing. □ 

Corollary 18.22. If f € Hyp 1 has a common interlacing then the determinant and 
trace ofi are in P. 

Proof. If det(f(A)) = then there is a w G C so that f(A)w = 0, and thus 
w*f (A)w = 0. By Lemma [l8.18l we know that w*fw S P for all complex w, so 
A is real. 

Since the diagonal elements have a common interlacing, their sum is in 
P. □ 

Lemma 18.23. Iff<gin Hyp 1 then trace(f) <Jtrace(g) in P. 

Proof. For any a G R we know that f + ag G Hyp x . Now the trace is a linear 
map Hypj — ► P, so trace(f ) + atrace(g) G P, which finishes the proof. □ 

Example 18.24. Here is a simple example of a polynomial with a common 
interlacing. If v = (a, b) and 

f = x (J ?) + (i q 

then 

v*fv= (a 2 + b 2 )x + 2ab 
Consequently, the roots of v*fv lie in the closed interval [—1, 1] and so 

(x-l)(x + l)I<f. 
The determinant of f is x 2 — 1, which is in P, as the corollary tells us. 
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Empirical investigations suggest that the adjugate of a matrix that deter- 
mines a polynomial in P^ os is in Hyp d . We prove this for d = 1, and show that 
there is a common interlacing for all d. 

Assume that Di, . . . , D d are positive definite (xby (_l matrices, and consider 
the matrix 

M = Id + xiDj H hx d D d . 

Recall that M[i, j] is the matrix formed by removing the i'th row and j'th col- 
umn of M. The adjugate of M is 

Adj [M] = (|(-l) i+j M[i,j]|)- 

The adjugate satisfies 

Adj[M] • M = |M|Id 

and our conjecture is that Adj(M) e Hyp d . First of all, we show that Adj(M) 
has a common interlacing. 

Lemma 18.25. IfM is as above and v € then |M| < v 1 Adj(M) v. In particular, 
v t Adj(M)ve P p d s - 

Proof. Choose v = (vi) e l^ 1 and define u = ((— l) x Vi). Introducing a new 
variable y, define 

N = M + y ■ uu*. 

Now N e P d +! since uu* is positive semi-definite. Also, uu* has rank one, so 
N is linear in y. Expanding |N yields 

|N| = |M|+u^(-l) i+j ViV,|M[i,j]| 

= |M|+yv*Adj(M)v 

We conclude that |M| < v* Adj(M) v since consecutive coefficients interlace. 

□ 

In order to show that Adj (M) e Hyp d we need to show that the coefficients 
of the homogeneous part are positive definite. It's easy for d = 1. 

Lemma 18.26. IfD is positive definite then Adj(Id + xD) e Hyp v 

Proof. Let M = Id + xD. The degree of |M[i, j]| is at most \i — 1, so the degree 
of Adj (M) is atmost (u.-l)|X. The coefficient of x^^ 1 ' is |D[i,j]|) = 

|D|D _1 . Since D _1 is positive definite, we see that Adj(M) e Hypj. □ 

We can sometimes substitute matrices for constants, and we get hyperbolic 
matrices with a common interlacing. Recall that the norm ||M|| is the maxi- 
mum eigenvalue of a positive definite matrix M and satisfies 

v 1 Mv 
= sup — - — . 
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Lemma 18.27. Suppose that M is positive definite and deg(f ) > deg(g). 

1. Iff + ocge Pfor < a s$ ||M|| then fl + gM e Hyp a . 

2. fI<fI + gM 

Proof. The leading coefficient of f I + gM is I. To check the inner product con- 
dition, compute 

v 1 (f I + gM) v = v*v + ^^g) 
Since ^ ||M|| the inner product condition holds. Next, 

f I + et(fl + gM) = (1 + a) ( f I + -^-gM 

V 1 + a 

and the latter polynomial is in Hypj since || y^M|| < ||M|| for all positive 
oc. □ 

Lemma 18.28. I/f € Hyp & has a common interlacing then |f | e Pa- 
Proof. If we substitute for all but one of the variables then we get a polynomial 
in Hypj with a common interlacing. We know that its determinant is in P, so 
substitution is satisfied. 

Assume that f has degree n. If we write f = Y.i a i x ' then the homogeneous 
part of |f | is I }Z|i|=n a i x 'l- ^ we replace each monomial x 1 by a new variable 
yi then we know that | X,|i|= n a iyi I nas ai l positive coefficients since all the ai 
are positive definite. Replacing the yi by y 1 shows that |f| H has all positive 
coefficients. 

□ 



Lemma 18.29. I/f < g in Hyp 1 then f + yg e Hyp 2 . 
Proof. It suffices to show that 

F e (x,y) =f(x+ ey) + (y + ex)g(x + ey) e Hyp 2 
for positive e. Now the homogeneous part is 

= f H (x + ey) + (y + ex)g H (x+ ey) 

and f H , g H are matrix polynomials with positive definite coefficients it follows 
that F^ 1 is a sum of polynomials with positive definite coefficients, and so all 
its coefficients are positive definite. 

Next, we verify substitution. Fix v e M. n and let G(x,y) = v t fv + yv t gv. 
By hypothesis v t fv<v t gv, so we know that G(x + ey,y + ex) e Pz- Thus, 
F e e Hyp 2 . □ 

Lemma 18.30. I/gl < f in Hyp 1 then 
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1. |gI+yf|eP 2 

2. If we write |I + yf\ — Y. ^d x )v x then e P. 

3. fi<gf i+ i. 

Proof. The first part follows from the previous lemma. If we expand the de- 
terminant we get 

\9l + Vf\ = Y_Q lL - i hWv i 

Since this is in P 2 we see that g^ _1 fi < g^ _(l_1 'fi+i which implies that 
fi<gfi+i. □ 

The next corollary generalizes the fact the the determinant and trace are in 

P. 

Corollary 18.31. If f e Hyp 1 has a common interlacing then V" |f[S]| e P. 

S|=k 

Proof. We have Sc{l,...,n) and f [S] is the submatrix of f only using the rows 
and columns in S. The sum in question is the coefficient of y n ~ k in |ygl + f \, 
and this is in P by the last lemma. □ 

Remark 18.32. If we drop the assumption that matrices are positive definite 
then we can find a 2 x 2 matrix M that satisfies v'Mv e P for all v, yet has no 
common interlacing. Choose 

F(x,y,z) =f + y g + zh + yzke P 3 

Choose a, |3 e M and a^O. Since 

(« p )(h °)(p) =a2[f+ ^ 9+h,+ 2k ] 

we see it is in P since it equals F(x, (3/a, (3 /a). For a particular example, con- 
sider 

(x + y + z + 1 ) (x + 2y + 3z + 5) 

= 5 + 6x + x 2 + y (7 + 3x) + z(8 + 4x) + 5yz + 2y 2 + 3z 2 

Taking coefficients yields the matrix 

[5 + 6x + x 2 8 + 4x\ 
M ={ 7 + 3x 5 ) 

The table below shows that there is no common interlacing. 

~b roots of (lbJMtlb) 1 

-5 -35.1 -5.8 
5 2.0 26.9 
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18.4 Orthogonal-type recurrences 

The fact that f<g<.h =>■ f— h<g is all we need to establish orthogonal 
type recurrences in Hyp d . 

Lemma 18.33. Define a sequence of matrix polynomials by 



f = U 

fi = Aixi + h A d x d + S 

fk = (aic,i*i H h a k , d x d + b k )f k _i - c k f k _ 2 



ro/iere U and the A\ are positive definite, S is symmetric, all c k are positive, and all 
ay are positive. 

Then all f k are in Hyp d and f k < f k _j . 

Proof. We note that f and fj are in Hyp d . By induction we have that 



The off-diagonal entries of a matrix polynomial in Hypj might not have 
all real roots. Here's an explicit example. The coefficient of x in fj below is 
positive definite, as is fo, so f 2 G Hypj. However, the off diagonal entry of 
f 2 has two complex roots. The diagonal elements have roots (—2.4, .4) and 
(—2.5, 1), so they have a common interlacing. 

In general, all the diagonal elements of the sequence f , f i, . . . form an or- 
thogonal polynomial sequence. The off diagonal elements obey the three term 
recursion, but the degree term might not be positive, and so they might not 
have all real roots. 



(a k/i xi H a k/d x d + b k )f k _i < f k _i < f k _ 2 



and the observation above finishes the proof. 



□ 




f 2 = (x + l)fi-f 
/ x 2 + 2x-l 




Example 18.34. Next, an example in two variables. The coefficients of x and 
y in fi below are positive definite, but not all entries of f 2 are in P d since the 
homogeneous part has positive and negative coefficients. 
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f 2 = (x + 2y + l)fi-f 

/ x 2 + 5ux + 2x + 6y 2 + 5y — 1 — x 2 — yx — x + 2y 2 + y — 3 \ 
" ^ -x 2 -ux-x + 2y 2 +u -3 2x 2 + 6yx + 3x + 4y 2 + 4y - 5 J 

18.5 Problems with products 

It's reasonable to think that a product such as 

(xI-Si)(xI-S 2 )---(xI-S n ) 

should be in Hyp l7 but this fails, even in the diagonal case. If we take each Si 
to be a diagonal matrix, then the product is a diagonal matrix, and we know 
that a necessary condition is that there is a common interlacing. To try to 
satisfy this condition, we can require that all eigenvalues of Sj are less than 
all eigenvalues of S 2 , which are less than all eigenvalues of S3, and so on. 
However, this does not always produce hyperbolic matrix polynomials. If we 
perturb a diagonal matrix that gives a hyperbolic polynomial by conjugating 
some of the terms, then we can lose hyperbolicity. 

If e xx — ( _^°?^ T ™^ ) then the graph of the inner product of the polynomial 

f(x) = (xI-e^(io)e 2 ^)(xI-e-(30)^) 

is given in Figure [T8.21 and is clearly not in Hyp^ 




Figure 18.2: The graph of a matrix product not in Hypj 



Here is an example of a product that is in Hypj. 

'_( 1-24 -0.42\\ / / 3.97 -0.15\\ ( ,_( 6.99 -0.04^ 
, 1-0.42 1.75 ) M 1-0.15 3.02 J M ^ 1-0.04 6.00, 



CHAPTER 18. MATRIX POLYNOMIALS 



554 



where the eigenvalues of the matrices are (1,2), (3,4), (6,7). The graph of 
(l,y) t f (1,tj) is in Figure [T8. 31 The three dashed lines are the asymptotes, and 
the dots are the roots of the determinant, drawn on the x-axis. The graph 
shows that f 6 Hyp^ since each horizontal line meets the graph in exactly 
three points. 




Figure 18.3: The graph of a hyperbolic matrix polynomial 



Some constructions that you would expect to give hyperbolic polynomials 
give stable matrix polynomials. For instance, if A is positive definite then 
(xl + A) 2 is generally not hyperbolic, but is a stable matrix polynomial. (See 

P.E52) 

18.6 Sign Interlacing 

Recall that f sign interlaces g if the sign alternates as we evaluate f at the roots 
of g. There is a similar definition for f € Hypj, where the role of alternating 
signs is replaced by alternating positive and negative definite matrices. 

Lemma 18.35. Suppose that g e P has roots r% < ■ ■ ■ < r n . 
Ifie Hyp 1 satisfies gl < f then 

(— l) n+1 f (rO is positive definite for 1 ^ i ^ n 

Proof. The hypotheses imply that v t (gl)v = (v t v)g <v*fv, so v*fv sign inter- 
laces g. Thus, (— l) n+1 v t f (ri)v > for all non-zero v, and hence (— l) n+1 f (ti) 
is positive definite. □ 

The converse can be used to show that a matrix polynomial is in Hypj. 

Lemma 18.36. Suppose that g e P has roots r\ < ■ ■ ■ < r n+ i. If 

1. (— l) n+1 f (ti) is positive definite for 1 ^ i < n + 1 

2. f has degree n. 



CHAPTER 18. MATRIX POLYNOMIALS 



555 



3. The leading coefficient off is positive definite, 
then f G Hyp 1 and gl < f. 

Proof. From the first condition we get that v*fv sign interlaces g. Since the 
degree of g is one more than the degree of v'fv, we conclude that g <v t fv. 
This implies that v*fv G P, and that gl < f. □ 

Construction 18.37. We can use sign interlacing to construct elements of 
Hypj. We start with the data 

real numbers pi < P2 < ■ ■ • < Pn+i 
positive definite matrices mi, • • • , m n+ i 

We claim that the following is in Hypj: 

n+l 

f(x) = y_ ^ n^—pi) ( i8 - 6 - 1 ) 

k=l i^k 

In order to verify that f G Hypj we see that 

f(Pk) = n(Pk-Pi) m k 

Mk 

and the sign of the coefficient of m k is (— l) n + k+1 since the Pi are increasing. 
Since the leading coefficient of f (x) is ^ k m k which is positive definite we can 
now apply the lemma. 

Note that if |i = 1 this is exactly Lemma [1.201 

Construction 18.38. There is a similar construction for polynomials in Hyp d . 
The idea is that we construct a polynomial by adding together lots of interlac- 
ing polynomials, multiplied by positive definite matrices. So, we start with 

positive definite d by d matrices Di, . . . , Da 
positive definite d by d matrices Mi, . . . , 

Let 

A = I + xiDi + --- + x d D d 
f (x) = |A| 

ft(x) = determinant of i'th principle submatrix of A 
g(x) = fi(x)Mi + --- + f d (x)M d 

Since f < ft it follows that fl <f|Mi. Addition yields our conclusion: f I < g in 
HyPd- 

The Hadamard product of two matrix polynomials f = Y. AiX 1 and g = 
Y_ Bi.x x is f * g = Y. A-i * ^i- xl where At * Bt is the usual Hadamard product 
of matrices. In certain circumstances Hypj is closed under the Hadamard 
product * with a positive definite matrix. 
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Lemma 18.39. Suppose that f < gl where f e HyP\ and g^P.IfPisa positive 
definite matrix then f * P < gl. In particular, f*Pe HyP\- 

Proof. If the roots of g are T\ <■■■ < r n then we know that (— l) n+1 f(rjj is pos- 
itive definite. Now the Hadamard product of two positive definite matrices is 
again positive definite [85 J, so 

(-ir +i (f*P)(rO = (-D n+i f(n)*P 

is positive definite, as is the leading coefficient of f * P. The conclusion follows. 

□ 

The Hadamard product of three hyperbolic matrix polynomials can be in 
HyPi- 

Lemma 18.40. If f, g, h e Hyp 1 have the same common interlacing then 
f *g*he Hyp v 

Proof. Let \\, . . . , r n+ i be the roots of the common interlacing. We know that 

(-l) n+t f(r t ), (-l) n+i g(n), (-l) n+i h(n) 

are all positive definite, and so (— l) n+t (f * g * h)(rt) is positive definite. We 
now apply Lemma [18.361 □ 

18.7 Root zones 

Hyperbolic matrix polynomials in one variable are generalizations of poly- 
nomials with all real roots. There are three different notions of the zeros of 
a hyperbolic matrix polynomial f (x), and all reduce to the same idea if the 
matrix dimension is 1 : 

1. The roots of the diagonal elements. 

2. The roots of the determinant of f . 

3. The roots of all the quadratic forms v*fv. 

What does the graph of a hyperbolic matrix polynomial f S Hypj look 
like? When \\. = 2 we can graph (l,u)f (l,y )* - Figure [T8.3| is a typical picture. 

The first point to note is that the graph does not contain large x values. 
This can be seen as follows. It suffices to consider v satisfying |v| = 1, and 
that the leading coefficient of f is the identity. In this case v l fv is monic and 
the coefficients of v*fv are all bounded (by compactness), and so the roots are 
bounded. 

If a polynomial f e Hyp x has a common interlacing g, then if we graph f 
and the vertical lines corresponding to the roots of g, then the strip between 
any two consecutive such vertical lines contains exactly one solution curve of 
f. 
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The usual terminology is as follows. If f E Hyp x define to be the set of 
all x such that there is a v where x is the k'th largest root of v f fv. is called 
the k'th root zone. 

Interlacing of root zones implies interlacing. 

Lemma 18.41. If the root zones of two hyperbolic matrix polynomials alternate then 
the polynomials interlace. 

Proof. Let r\, . . . , r n be the root zones of r, and s\, . . . , s n the root zones of s. 
The hypotheses mean that the zones are arranged as in the diagram 




r l Si T 2 S2 S n 



The ft's and the gi's are values that separate the various root zones. Let f = 
Y\( x — f-i) an d g = nt x — 9i)- The interlacings 

r<fl<s r,s<gl 

easily imply that r ^ s. □ 

Example 18.42. The converse of this lemma is not true - it fails even for diag- 
onal hyperbolic matrix polynomials. Let 

/(x + 32)(x + 36) \ ,/2(x + 34) \ 

^ (x + 14)(x + 35y \ 2(x + 24.5)J 

We know that f < f yet their root zones overlap (Figure H 8. 4b - 



-34 
E— 



-24.5 
—3 



-36 



-3 

-35 



-32 



-14 



Figure 18.4: Overlapping root zones 



Lemma 18.43. The diagonal elements of a hyperbolic polynomial have a common 
interlacing. 

Proof. If f e Hypj then each two by two principle submatrix is in Hypj, and 
the two diagonal elements have a common interlacing. Consequently every 
two diagonal polynomials have a common interlacing, so they all have one. 

□ 
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If the degree of a matrix polynomial is greater than two then we do not 
have nice pictures such as Figure H8.3I We can make another graph by draw- 
ing all of the graphs v t fv for all non-zero v. The result looks like Figure [T8. 51 
The points on the x axis show that all the curves v*fv have a common inter- 
lacing. If A m i. n (M) and A mQX (M) are the smallest and largest eigenvalues of a 
symmetric matrix M then we have the inequality 

v*fv 

A mi n(f(x)) < — < A 

max fx 

The dashed boundary curves in the figure are the graphs of the maximum and 
minimum eigenvalues of f (x) as a function of x. 



\ 

\ 




Figure 18.5: The graphs of v t fv as v varies 



The Hadamard product preserves Hypj if one of the terms is a diagonal 
polynomial. 

Lemma 18.44. If f, g £ Hyp 1 and f is diagonal then f * g e Hyp 1 . 

Proof. Since the diagonal elements of f and of g have common interfacings 
F and G, the result follows from the fact that fu<F and ga<G implies 
(Lemma 17.9b 

f« * gu < F * G 

□ 

The roots of the diagonal fall in the root zones. Indeed, if the diagonal 
elements of f are ft, and e\ is a coordinate vector, then e\ iz\ = fi. 

The location of the roots of the determinant of a hyperbolic matrix polyno- 
mial fit nicely into the root zone picture. 

Lemma 18.45. Suppose that f e Hyp 1 (n) has a common interlacing. Then, det(f) 
has n\i roots, all real. In the interior of each root zone all f (x) are positive definite or 
negative definite. There are p. zeros of the determinant in each root zone. 

Proof. This follows from Lemma 118.351 since we know how many positive 
eigenvalues (\x or 0) there are in each region between the root zones. As a 
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procedes from one root of the common interlacing to the next, there is a change 
from all eigenvalues of one sign to all eigenvalues of the opposite sign. Each 
time an eigenvalue changes sign, the determinant must be zero. □ 

Example 18.46. Suppose that |j. = n = 3, and that the roots of det(f) are 
T\, . . . , r<). Then 

interval number of positive roots 

(-00,-rO 

(ri,r 2 ) 1 

(t 2 ,t 3 ) 2 

(r 3 ,r 4 ) 3 

(r 4 ,r 5 ) 2 

[T5,T 6 ] 1 

(r 6 ,r 7 ) 

[t 7 ,t 8 ) 1 

(r 8 ,r 9 ) 2 

(r 9 ,oo) 3 

We can show that a polynomial is in Hypj by checking if all roots of the 
determinant are real, and that the number of positive eigenvalues in each in- 
terval follows the patterns described above. 

18.8 Quadratics and Newton inequalities 

Quadratic matrix polynomials have been widely studied [111]. We derive a 
simple inequality for eigenvalues of the quadratic, and use it in the usual way 
to get Newton inequalities for hyperbolic matrix polynomials. 

There are necessary and sufficient conditions for a quadratic matrix poly- 
nomials to belong to Hypj, but they are not as elementary as for polynomials. 

Lemma 18.47. Suppose f (x) = Ax 2 + Bx + C where A and C are non-negative 
definite. 

1. Iff € Hy Pl then A max (BA _1 ) A^JBC- 1 ) ^ 4. 

2. I/A^tBA-^A^BC- 1 ) >4thenf e Hy Pl . 
Proof. If f e Hyp! then 

v'Avx^v^vx + v^v e P 

Thus the discriminant must be non-negative, and so 

(v'Bv) 2 ^ 4(v t Av)(v t Cv) 

Now v t Av and v*Cv are non-negative, so this is equivalent to 

v t Bv .yfiCv 

— — ^ 4 

v t Av v t Bv 

We now apply ( I18.2.3D . □ 
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Corollary 18.48. Suppose that f = £ Atx 1 e Hyp[ os (n). For k=l,. . . ,n-l the 
following inequality holds: 



A max (A k A k ; i )A mQX (A k A-i 1 ) ^ (1 + -)(! + 



k" n-k J 

Proof. Since Hyp] 3 os is closed under differentiation and reversal, we can fol- 
low the usual proof of Newton's inequalities. □ 

The coefficients of the associated quadratic form are not arbitrary. Suppose 
that f (x) — Y_ Aix 1 is a matrix polynomial in Hypj os (n). We know that for all 
v ^ and < i < n 

v*Av 

A mtn ^ ^_ ^ .max 

Define the two vectors 

A f"\min a min -i 

fimax imaxi 

max — l A o ' • ■ • ' A n I 

The inequality above shows that for |v| = 1 all the polynomials v*fv are poly- 
nomials of the form poly(c) where ^ A m i. n < c ^ A max . (See p.|39l). 



18.9 Newton inequalities in Hyp 2 

The coefficients of a quadratic in Hyp 2 satisfy inequalities, and they lead to in- 
equalities for general polynomials in Hyp 2 . We begin with an example. Con- 
sider the polynomial f (x, y) (p. 1552b : 



( -3 -5 ) ( -1 3 1 ) X ( -1 2 ) %2 

If we compute v x fv then we get a polynomial in P 2 . For instance, if we take 
v = (2,3) we get 

84 y 2 



68 y 62 xy 



-85 23x 



10 x 2 



CHAPTER 18. MATRIX POLYNOMIALS 



561 



This polynomial satisfies the rhombus inequalities, so we can use this in- 
formation to get inequalities for the original polynomial. In general, we start 
with f (x,y) = Y_ AyxHj' in Hypj os . If v is any vector then 

v*fv = ^(^Ayv) *V € P2 

The rhombus inequalities (Proposition I10.4lb imply that the following holds 
for all vectors v 

(v* A u+1 v) (v* A i+1J v) > (v* Ay v] (v* A i+1 , j+1 v) 
Using dl8.2.3b this implies rhombus inequalities in Hyp 2 : 

^maxfAij + iA^j 1 ) ^minCAi+lj+lA^j j ) 

18.10 Singular polynomials 

We now look at some examples of matrix polynomials f for which the leading 
coefficient of a matrix polynomial is positive semi-definite and singular, and 
v*fv e P for all v G M^. Unlike hyperbolic matrix polynomials, the root zones 
can be infinite, or all of them can even be equal. 

First, we consider the matrix F = ([J) where deg(f) > deg(h). If f is 
monic then the leading coefficient of F is (Jjj), which has eigenvalues and 1, 
and is positive semi-definite. The quadratic form is <x 2 f + 2<x|3h. which is in P 
for all a, |3 if and only if f <J h. Figure [T8. 61 is the graph of such a polynomial. 




Figure 18.6: Positive semi-definite leading coefficient 

Assume that f <h. If e > and F e = (£ ^ ) then F e e Hypj since its 
leading coefficient is ( J ° ) , and the quadratic form is a linear combination of 
f and h. Consequently, F is the limit of polynomials in Hypj. Note that F has 
a common interlacing, but the root zones can be unbounded. 

Next, we have an example with no common interlacing. Let 



_/(x + l)(x + 3) 3x + 5\ 
\ 3x + 5 8 J 
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The leading coefficient is again ( J [] ) and the quadratic form factors: 

(a,(3)F(a,(3) t = (a(x + 1) + 2(3)(a(x + 3) +4(3). 

Consequently, for every x there are two v such that the quadratic form 
v t F(xo)v is zero. Thus, the two root zones are both equal to R, there is no 
common interlacing, and so F is not a limit of polynomials in Hypj. 

18.11 Matrix polynomials with determinants in P d 

In this last section we consider the properties the set of matrix polynomials 
with determinants in P d . Define 

M = {matrix polynomial M | |M| <E Pd for some d} 

If y is a variable not occuring in A or B then we say that A <-P B if and only 
if A + uB e M. 

If the matrices are 1 by 1 then this is just the usual definition of interlacing. 
One interesting question is what properties of Pd extend to M? Clearly M is 
closed under substitution of real values for variables, and of Xi + a for xt. 

Here are some basic interlacing properties: 

Lemma 18.49. I/A^B then 

1. B H 3 —A 

2. A+kBh 3 B/orae R. 

3. AffBl a A for ae R. 

4. f/CeM then AC <-P BC and CA H 3 CB. 

Proof. If |A + yB\ e Pd then reversing just y yields |yA — B| e Pd- Next, 
replacing y by y + a shows that |A + (y + a)B e Pd- From B +-P —Awe get that 
B + oc{— A) H 3 —A, and applying the first part again gives A H 3 B + (— a)A. 
For the next one, note that |AC + yBC| = |C||A + yB|. □ 

Some constructions: 

Lemma 18.50. 

1. If A is symmetric, all Dt and B are positive semi-definite, then 

A + ^xiDi B. 

2. J/Z = diag(zi, . . .,z d ) and A is positive semi-definite then 

a) Z + A e M 

b) 2-AeM 
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c) AZ + I e M 

d) ZA + I e M 

e) IfW = diag(wi) and O is orthogonal then OZ + WO e M, 

Proof. The first one is the statement that |A + yB + Y. x iDi| 6 Pd- For the 
second, reverse each variable separately, and then reverse all at once. The 
next follows from the fact that A + Z = A + ^ztDj. where Dt is the positive 
semi-definite matrix that is all zero except for a 1 in location (i, i) . If we reverse 
all variables we have 

Zi---z d |A + Z -1 | = |AZ + I| = |ZA + I|. 

For the last one, Z + _1 WO is a sum of variables times positive semi-definite 
matrices. □ 

If a < b then x + a <C x + b. The corresponding result for M is 

Lemma 18.51. If A < B are positive definite and Z = diag(zt) then 

A + Z B + Z. 

Proof. Since B — A is positive definite, Lemma [18.501 implies that 

AlZffB-A. 

Applying Lemma [l 8.491 yields the conclusion 

A + Z^B-A + (A + Z)=B + Z. 

□ 

Corollary 18.52. With the same hypotheses, BZ + I <-p AZ + I. 

Proof. Reverse all variables. □ 

As [16J observed, it is an interesting question if A + Z <-P B+Z implies 
A < B. 



Part III 

Polynomials with complex roots 
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Chapter 




Polynomials non-vanishing in a 
region 

In this chapter we introduce polynomials that are non-vanishing (NV) in a 
region D. We show how geometric properties of D imply properties of the 
sets of polynomials such as closure under differentiation, or under reversal. 

Definition 19.1. Suppose that D C C. Define 

NV d (D) = {f | f has complex coefficients and f (a) ^ for all cr e D} 
rNV d (D) = {f | f has real coefficients and f (cr) ^ for all cr e T>) 

Here are some examples of the spaces we will consider. 

Example 19.2. If D is the product of d open upper half planes, then a polyno- 
mial in NVd (D) is called a upper half plane polynomial. It is a real upper half-plane 
polynomial if it is in rNVd(D). The spaces of such polynomials are denoted 
HPd and rHPd. Many results about these polynomials can be found in 1251 , 
H8l and Chapter|20l 

Example 19.3. If D is a product of d open right half planes then a polyno- 
mial in NVd(D) is called a right half-plane polynomial, or a Hurwitz polynomial. 
NVd(D) is denoted 5C d (C) and rNV d (2)) is denoted JC d - In the JCi(C) case, 
a right half-plane polynomial has all roots in the closed left half plane. Such 
polynomials are usually called stable, or Hurwitz-stable. More information 
about these polynomials can be found in |33] and Chapter |2T] 

Example 19.4. If § is a vertical strip such as {z | — 1 < 9t(z) < 1} then we 
consider D = C\§. The set NVi(C\§) consists of polynomials whose zeros all 
lie in the strip §. More information about them can be found in Section fl 9. 101 
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Example 19.5. The open unit disk A in C has two generalizations to C d . The 
first is the open ball A d of radius 1, and the second is the product A x • • • x A. 
The spaces are considered in Section [l9.11l 

Example 19.6. Define the sector S n = {z | — 7t/n < arg(z) < 7t/n}. For instance, 
Si is C \ (—oo,0), and 82 is the open right half plane. We consider properties 
of product of sectors NVatO^ (§ ni )) in various examples in this chapter and 
Section[l9l2 

19.1 Fundamental properties 

In order to derive interesting properties we we usually assume that our do- 
main is an open cone C in C d . This means 6 is closed under addition and 
multiplication by positive constants. We have a few properties that hold for 
general domains. The proofs are immediate from the definition. 

Lemma 19.7. 

1. f(x)g(x) e NV d (T>) if and only t/f (x) € NV d {T>) and g(x) € NV d [D). 

2. Suppose f(x,y) e NV d+e {T) x £) where D c C d and £ c C e . IfaeE then 
f(x,a)eW d (2). 

The following classical theorem of Hurwitz also holds for analytic func- 
tions in d variables. 

Theorem 19.8 (Hurwitz). Let (f n ) be a sequence of functions which are all analytic 
and without zeros in a region CI. Suppose, in addition, that i n (z) tends to f(z), 
uniformly on every compact subset of CI. Then f (z) is either identically zero or never 
equal to zero in CI. 

The next lemma is an immediate consequence of the general Hurwitz's 
theorem: 

Lemma 19.9 (Polynomial closure). IfD is open and {ft} z's a sequence of polyno- 
mials in NV d (T>) that converge uniformly on compact subsets to a polynomial g then 
either g z's zero or is in NV d (T)). 

The next lemma gives several simple ways to construct new polynomials 
in NVd(C) from old ones. The proofs easily follow from the definition. 

Lemma 19.10. Assume that C z's a cone. 

1. Iff(x) G /W d (C) f/zen f (x + y) e NV 2d [G x 6) where x = {x lr ...,x d ) and 
y= (ui,...,u d )- 

2. If a is a positive constant and f(x) S /W d (C) then f(axj, ...,ax d ) e 
NV d (G). 

3. If at are positive constants, Ct are cones, and f(x) e A/Vat]^ CO then 
f(a 1 x 1 ,...,a d x d ) e A/V d (TTei). 
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e 


e- 1 


P 


e/e 


UHP 


-UHP 


-1 


C\ (-oo,0) 


RHP 


RHP 


1 


C\ (-00,0) 


Qi 


Qiv 


1 


RHP 




§n 


1 


§2n 



Table 19.1: Properties of cones 



The homogeneous part f H of a polynomial consists of the terms of maxi- 
mum total degree. 

Corollary 19.11. I/f G NV d {H)G then f H e NV d (G). 

Proof. This follows from the above, the fact that f H is never zero and 

fH(x) = lim !™ 

r— >oo T n 

where n is the degree of f H . 

□ 

The next result refines the earlier substitution result. 

Lemma 19.12 (Substitution). Suppose that T> is open and T> is the closure ofD. If 
f(x,y) G NV d+e (D x E)andot€ 1 5then-f{ot,y) G M/ e (£) orf(a,y) =0. 

Proof. If a G T) then f(a,y) G NV d (£). If a G T> then we approach f(ct,y) by 
sequences in NV<i(2) X £), so we can apply Lemma [l9.9l □ 

The constant in the next lemma only depends on 6 and is called the reversal 
constant . 

Lemma 19.13 (Reversal). Assume 6 C C is a cone and suppose that Y_ xl, fi(y) G 
NV d+ i{G x D). There is a unique p G C with |p| = 1 such that 

I(pxr%)eW d+1 (exl)). 

Proof. If C _1 = {z _1 | z G Ci} then C _1 is also a cone, whose elements are the 
conjugates of C. Since C and C _1 only differ by a rotation there is a unique 
rotation p G C so that p = C. If (oi, . . ., cid+i) G 6 x D then 

r(P0-ir~ l fi(ff2,...,(rd +1 ) = (pffi) ,l I( 1 ) i fi((J2,...,(Td + i) 
* — * — (Jl 

and this is non-zero since p/<Ti G G. See Table ll9.T] □ 

We now consider when a linear polynomial is in NVd('D). The following 
are elementary. 
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Lemma 19.14. The following are equivalent, where 6i.cC are cones. 

1. LfoiXiGA/Vatfieo. 

2. QtCt 

3. }Zf a i *s contained in a half plane. 

Example 19.15. Suppose 6 = §2 x S4 and consider x + cuj . In order for §2 + 0.84 
to remain in a half plane, we must have that a§4 C §2- Thus we conclude that 

x+ay e NV 2 (S 2 x S 4 ) -7t/4 a ^ tt/4 

If r > and 6 c C then x + ry e NV 2 (6 2 ), and hence pxy + r and xy + r/p 
areinNV 2 (6 2 ). 

There's a simple condition if all the planes are half planes. 

Lemma 19.16. If all cones d are the same half plane 6, then the following are equiv- 
alent: 

1. LfatXi e M/ d (6 d ). 

2. All non-zero at /woe f/ze same argument. 

Proof. If ^ ai*,. € NV d (6 d ) then we may assume that <xi is non-zero, so we 
may divide through and assume 0Ci = 1. We now need to show that all non- 
zero at are positive. If some at is not positive, then 6 and aC are different, 
and so their sum equals C. The converse is immediate. □ 

The corollary is simple and the proof is omitted. 

Corollary 19.17. If 6 is a half plane then (3 + xi + a 2 x 2 H h a d x d e A/\/ d (e d ) 

/f and only if 

1. All on are non-negative. 

2. |3 is in the closure of 6. 

Following [18,33] we have 

Lemma 19.18. Suppose 6 c C and f (x) is homogeneous. The following are equiva- 
lent 

1. f(x) e NV d (G d ) 

2. f(a + tb)e NV^e/e) where z,b>0. 

Proof. 6/6 denotes the set {a/|3 a, (3 e 6}. If a/ (3 € 6/6 where a, (3 e 6 and 
f has degree n then by homogeneity 

f(a+ (a/P)b) = (3 n f((3a + ab) 7^0 

since [3a + ab S 6 d . Conversely, let <j\, . . . , cr d e 6. Since 6 is a cone there are 
a, (3 € 6 and a,b > such that (3 a + ab = . . ., cr d ). Thus 

f(ffi,...,CT d ) =f(Pa + ab) = |3 n f(a + b(a/|3)) ^0 

□ 
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19.2 Differentiation 

We can not differentiate with respect to a variable and stay in the same class, 
but we can if the corresponding factor is a half plane. 

Lemma 19.19. J/f(x,y) e NV d+1 (G x CD) and G is a half plane then ^f(x,y) e 
NV d+ i(fixB). 



Proof. Suppose that (ai, . . . , a d+ i) G 6 x CD. Since 

" 3 

I I m i A' ; ii = 

1 3x 



, v x,a 2 ,...,a d+ i) 
3x 



(«i) 



it suffices to assume that d = 1. So, if g G NVj(e) then all roots lie in the 
complement of 6. Now C is a half plane by hypothesis, so its complement is a 
closed half plane, and in particular is convex. By Gauss-Lucas the roots of g ' 
lie in the convex hull of the roots of g, and hence lie in the complement of C. 
Consequently, g' G NV x (e). □ 

Lemma 19.20. IfG is a half plane, f (x,y) = Y. x l fi(y) G M/ d+1 (e x CD) then either 
f i (y ) —0 or 

fi(y) e NV d [D) fori = 0,1,... 

Proof. Differentiate with respect to x until the desired coefficient is the con- 
stant term, and then substitute zero for x. □ 

Example 19.21. The assumption that 6 is a half plane is essential. We give an 
example of a polynomial in NV2(§4) whose coefficients are not all in NVi(§4). 
Let 

f(x) = (x-l) 2 +4 = (x-l-2i)(x-l+2i). 

Clearly f(x) e NV 1 (S 4 ),butf'(x) =2(x-l) ^ NV a (S 4 ). Now consider 

f(x + y) = (x-l) 2 + 4 + 2u(x-l)+u 2 

Weknowf(x + u) e NV 2 (S 4 ), and the coefficient of y° in f(x + y) is inNV a (§ 4 ) 
but the coefficient of y 1 is not. 

Lemma 19.22. lfY_ x l fi(y) € A/U d+1 (e x CD) and 6 is a half plane then 

fi(y) +xf l+a (y) G A/U d+1 (e x CD) for i = 0,1,... 

Proof. Because 6 is a half plane we can differentiate and reverse, so all the 
polynomials below are in NV d +i(e x CD). 

reversing ^~ (px)"^ 1 fj(y) 

n - k- 1 differentiations ^(p) n_i x k+1 ~ i ( n zl i ) n _ k _ 1 f t (y) 

reverse ^(p)™ (px) x ( n - i ) T1 _ > ,_ 1 f t (y) 

k differentiations ^Jp)^ 1 (px) i ~ k ( n zl i ) n _ k _ 1 (i) 1c f i (y) 

which equals p n (f k (y) (n - k) !k! + x f i+1 (y) (n - k - 1 ) ! (k + 1 ) !) 
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Dividing and rescaling yields the result. □ 
Corollary 19.23. J/f(x) g NV d+1 (G x D) and C is a half plane then 

f(x)+y^- g NV d+2 (e x G x V) 

OXj 

P ^f(x) + ^eiw d+2 (exexi)) 

where p is the reversal constant. 

Proof. Since f(xi + y,x 2/ . . .,x d ) g NV d+2 (e xCxB) we can apply the lemma 
to the Taylor series 

91" 

f(xi+y,-.-,x d } =f(x) + — y + --- 

The second statement follows by reversing the first. □ 

We have seen that coefficients of the homogeneous part of a polynomial in 
P d all have the same sign. A similar result holds for NV d (C): if we multiply 
by the appropriate scalar then all the coefficients are positive: 

Lemma 19.24. IfC is a half plane and f g NV d [Q d ) then all non-zero coefficients of 
f H have the same argument. 

Proof. We first assume that f is homogeneous and use induction on d. For 
d = 2 we note that x + ay g NV 2 (C 2 ) if and only if a ^ 0. Thus if f g NV 2 (C 2 ) 
is homogeneous then 

f = a]^[(x + cuy) where all at > 

i 

so all non-zero coefficients have argument arg(oc). 

If f(x,y) g NVd+i(C d+1 ) is homogeneous then we can write 

f(x,y) = 21y i fi(x) 

i 

where all ft g NVd(C d ) and are homogeneous. By induction all coefficients of 
ft have the same argument. 

By Lemma[l9l8]with a = (1, . . . , 1, 0) and b = (0, . . . , 0, 1) we have that 

f(l...,l,t) =£_t i f i (l,...,l) g P pos 

since C/C = C\(— oo,0). Now the argument of ft (1, . . .,1) equals the argument 
of the coefficients of ft, and since f (1, . . . , 1, t) g P pos all coefficients have the 
same argument. 

If f is not homogeneous and has degree n then the conclusion follows from 
CorollaryETE] □ 
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19.3 Interlacing 

Interlacing was initially defined by the positions of roots, and then by linear- 
ity Now we define it algebraically 

Definition 19.25. Suppose f , g G NVdCD). We say that g interlaces f in G, writ- 
ten f ♦«> g, if and only if 

f(x)+yg(x) e NV d+1 (B x G) 

Remark 19.26. Note that if C C C then interlacing in 6 implies interlacing in 
C, but the converse is false. For example, we claim that 

x 2 + y(ax + b) G NV 2 (S 4 x § 4 ) ifa,b>0 

This is clear, since both the image of §4 by x 2 and y ( ax+b) lies in §2. However, 
we also have that f (x,y ) = x 2 + y (x + 1) is not in NV2(§2 x §4). To see this, we 
need to exhibit one zero in § 2 x §4 : 

f( (l + 2i)/8, (19-42i)/680) =0 

However, it's not had to check that x 2 + y (x + 1) G N V2 (§2 X §4), and therefore 
is also in NV 2 (§4 x 84) 

We first have some properties of interlacing that do not require one of the 
cones to be a half plane. 

Lemma 19.27. Assume that f, g, h e NVd[T>), and G is a cone. 

1. f g f h in G iff g «~ h in G. 

2. If-1 g G then f «~ fin G. 

3. If pis the reversing constant for G then f g in G implies pg «~ f in G. 

4. J/0 £ Ci + C 2 and f G A/t^Ci x C 2 ) t/ien xf f m G 2 - 

Proof. The first is trivial. To check if f f we need to see that f (x) + y f (x) = 
f (x) (y + 1) is non-zero. Now f (x) ^ 0, and y + 1 ^ by hypothesis. For the 
next one, f + yg G NV d+ i(e x G ) implies that pg + f G NV d+ i(e x G ). 

Finally, to check xf *^ f we check that xf + yf G NV2(Ci x 62). Simply note 
that (x + y )f 7^ by hypothesis. □ 

Most interesting results about interlacing require a half plane. As always, 
we expect adjacent coefficients as well as the derivative to interlace. The corol- 
lary is a consequence of Lemma [l9.22l and Corollary ll9.23l 

Corollary 19.28. Suppose G is a half plane andf = Y. ^fi(y) G NV d (G x D). 

1. fi «~ fi + i for i = 0,1, 
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Lemma 19.29. IfC is a half plane and f, g, h, k e NV d (D) then (all interlacings in 

e) 

1. f «~ g anrf f implies f w ag + bh/or a, b > 0. 

2. f «~ g and h, k implies fh «~ fk + gh. *~ gk. 

3. g *~ f and h *~ f implies ag + bh *~> f for a, b > 0. 

4. If f «~ g «~ h £/zen f + ph. <^ g. 

Proof. All these results follow using the interlacing of adjacent coefficients. 
For the first we have 

(f + yg)(f + yh) = f 2 + yf(g + H) + y 2 g h 

so f 2 *~> f (g + h) and therefore f «~ g + h. The second is similar. For the third 
we use 

(g + ypf)(g + yph) = g 2 + gyp(f + h) 

so g *~ p(f + h) and reversing again yields f + h <^ g. The last follows from 
the third since ph. <^ g. □ 

Next is the recurrence for orthogonal-type polynomials. 

Lemma 19.30. Assume 6 is a half plane and -1 £ 6. If fo,fi € A/Vd(e x D) ften 

f i <^ fo 

fn — ( a n x i + b n )f n _i + p c n f n _2 where a n , b n ,c n >0. 
f/zen f n ^ f n _i in 6. 

Proof By induction we may assume that f n _i f n -2- Since —1^6 we know 
b n fn-i <^ fn-i- Since 6 + 6 doesn't contain zero we know Xif n _i «~ f n -i, 
and thus (a n xi + b n )f n _i *~ f n -i- By reversal of f n _i «~ f n _ 2 we have 
pfn-2 <^ fn-i/ so addition gives the conclusion. □ 

Example 19.31. We can use differentiation to derive properties of NVi(S 4 ) 
even though NVi (84) isn't closed under differentiation. Since x+y 2 and x+(y+ 

l) 2 are in NV2(§2 x §4) the coefficients of powers of x in [x+y 2 ] 2 [x+ (y + 1) 2 ] 2 
interlace. Thus all the following polynomials are in NVi(§4) and interlace in 

§2. 

y 4 (y + l) 4 *~ 2y 2 (y + l) 2 (2y 2 + 2y + l) <~ 
6y 4 + 12y 3 + 10y 2 + 4y + 1 «~ 2 (2y 2 + 2y + l) *~ 1 

We can use the example above to give a general construction. 
Lemma 19.32. Iff, g e /Wi(S 4 ) and f «~ g in § 2 then 

y 2 f(y) «~ f(y) +y 2 g(y) «~ g(y) in §2 
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Proof. Since x + y 2 and f(y) + xg(y) are in NV 2 (§2 x S4) their product is also. 
Thus 

y 2 f (y ) + x[f (y ) + y 2 g(y )] + x 2 [g(y )] e NV 2 (§ 2 x S 4 ). 

□ 

Remark 19.33. It follows from the lemma that the two matrices 



1 y 2 
1 



y 2 
l y 2 



map pairs of polynomials in NVi(§ 4 ) that interlace in § 2 to another pair of 
polynomials in NVi(S 4 ) that interlace in S 2 

19.4 Some determinant properties 

In this section we establish some properties of determinants we need to con- 
struct polynomials. 

If A and B are anti-diagonal matrices with anti-diagonal entries ( at ) and 
(bi) then 



A I 

-I B 



(aib n + l)"-(a n bi + l) 
The proof is an easy induction. For instance, if n = 4 then 



(19.4.1) 





. aj 1 










a 2 


1 






a 3 






1 




a 4 








1 


1 








bi 


. -1 


-1 . 
. -1 b 4 


b 3 


b 2 





(Qib4+l)(a2b3+l)(a 3 b2+l)(a4bi+l) 



where the dots "." are zeros. A similar argument shows that if I is the n by n 

I ll = 2 n_ 



identity matrix then 



il I 



Lemma 19.34. If A is a skew symmetric matrix, B is positive definite, and S is 
symmetric then 

|A + B+iS[ ^0 

Proof. A + B + iS ^ since B is positive definite. If |A + B + iS| = then there 
are non-zero real u, v such that 



(A + B+iS)(u + iv) =0 
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Now A, B, S, u, v are real, so we can separate real and imaginary parts: 

Au + Bu - Sv = 
Av + Bv + Su = 

The inner product < r, s >= r t s satisfies < Ar, r >= since A is skew- 
symmetric. Thus 

< Bu, u> — < Sv, u > = 

< Bv,v > + < Su,v > = 

S is symmetric, so adding the last two equations yields 

< Bu, u> + <Bv,v> = 

However, B is positive definite and < Br, r > is positive unless r = 0. Thus 
u = v = 0, a contradiction. □ 

Corollary 19.35. If S, B are symmetric and either one is positive definite then |S +tB| 
is not zero. 



Proof. Take S = 0. If necessary, multiply by x. 
Lemma 19.36. 

' A B 

-B A 

If A is a skew-symmetric matrix then 
Proof. Assume all matrices are n by n. We compute 
22 n 



□ 



= |A + iB| • 




-iB| 




-A B 




A 


B 


-B -A 




-B 


A 



A B 

-B A 



I (A B W I il 



-tl I 

2(A + tB) 
2(A-tB: 



A 7 HI I 



The second part follows from the first. 



□ 



Corollary 19.37. If A is a skew-symmetric matrix, S is symmetric, and B is positive 
definite then 

A S + iB 
-S - iB A 



^0 



Proof. The determinant equals |2(A — B+iS)|-|2(A + B— iS)|, and is non-zero 
by LemmaUm □ 



In the next result we determine properties of eigenvalues. 
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Lemma 19.38. Suppose that A, B are positive definite and oc + (3i is an eigenvalue 
o/A + Bi. 

2. < a < p(A), < (3 < p(B), 6 /a < p(BA _1 ). 

2. If A is only symmetric then (3 is positive. 

3. IfB is only symmetric then a is positive. 

Proof. Let u + iv be an eigenvector of a + Si, where u, v are real. Then 1 
(A + Bi)(u + tv) = (a + 6i)(u + iv) 

Separating real and imaginary parts yields 

Au — Bv = au — Bv 
Bu + Av = av + 6u 

< x, y >= x l y is the usual inner product. The symmetry of A and B yields: 

< u, Au > — <u,Bv> = a<u / u> — |3 <u,v> 
<u / Bv> + <v,Av> = a<v,v> + |3 <u,v> 



We can now solve for a, and a similar calculation yields |3 

< u, Au > + < v, Av > 

a = p A 

< u,u > + < v,v > 

<u,Bu> + <v,Bv> 

6 = sC p(B) 

<u,u> + <v,v> 

|3^<u / Bu> + <v / Bv> g;p(BA _ 1) 

a < u, Au > + < v, Av > 

We use these formulas to establish the various parts of the lemma. □ 



19.5 Determinants 

We can construct elements of various NVd(C) using determinants. 

Corollary 19.39. If y e C \ 0, H is the upper half plane, Dt are n x n positive 
definite matrices and S is symmetric then 

d 

\yS + ^x k D k \eNV d ((yH) d ) 
l 



Thanks to math.sci.research and Tony O'Conner for this argument. 
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eigenvalues of A + Bx 



Figure 19.1: Location of roots of | - xl + A + Bi| if B > A. 



Proof. If yui, . . . , yod e yH where cr k = ot k + x|3 k then 



yS + ^x k D k {yoi,...,yo & ) = y n |S + ^ cr k D k | 



y 



| (S + ^ a k D k ) + iQ^ |3 k 



and this is non-zero by the lemma since 6 k > and so Y. PkD k is positive 
definite. □ 

Next we construct polynomials in NV<i(C d ) where 6 is not a half plane. 

Lemma 19.40. If A^, B k are positive definite and C is the first quadrant then 

d 



^x k (A k + iBi< 



€ NV d {G c 



Proof. If oc k + i(3 k € 6 then a k and 6 k are positive, and 



^Ja k +i|3 k )(A k +tB k ) = ^(a k A k -(3 k B k )+i^(a k B k + (3 k A k 



(19.5.1) 



Since ^f(oc k B k + (3 k A k ) is positive definite the result follows from 
Lemma Q23S □ 

Lemma 19.41. If A k is positive definite, A k > B k and e={a + bi|a>b >0} 
then 

d 



^x^Ak + iB^ 



S NV d (G d ) 



Proof. The hypothesis imply that 

a k A k - (3 k B k = A k (a k - 6 k ) + |3 k (A k - B k ) 
is positive definite, so the result follows from Lemma [19.351 and (|L9.5.1b . □ 
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Lemma 19.42. Suppose A and B are symmetric and f (x) = | — xl + A + Bi|. 
IfB > A > then f e NV^V) where D = {a + bt | a > b > 0}. 

Proof. Since B > A we know that p(BA~ : ) > 1. From the lemma the eigenval- 
ues <x + (3i satisfy |3 > a > 0. The roots of f are the negative eigenvalues of 
A + Bt, so they lie in the region of the first quadrant bounded by x = y and 
x = 0. See Figure IT911 

□ 

19.6 Linear transformations 

Properties of the Hadamard product in P depended on the identification of 
the Hadamard product as a coefficient: 

If f(x) 6 P pos and g e P then f(-xy) and g(y) G P 2 , and the 
Hadamard product is a coefficient of f (— xy )g(y). 

The reason why f (— xy) S P2 is that a — xy S P2 if and only if a > 0, and this 
follows by reversing x + ay. A similar argument works for half planes. 

Lemma 19.43. Suppose that G is a half plane with reversal constant p, and I is the 

ray p _1 E + . The Hadamard product 

determines a map 

P l xNV d (e d ) — >NV d (e d ) 

Proof. If r > then x + ry e NV 2 (C 2 ) so pxy Ire NV 2 (C 2 ) and hence xy — 
i-r/p) e NV 2 (C 2 ). Thus, if f (x) e P { (n) then f (xy) e NV 2 (C 2 ). Write f = 
^ aiX 1 and g — 9i(y) xl - The coefficient of z n in 

f (xz)g(z) = Y_ a i z " 9j (y) z ' 

y 

is in NVd(e) and equals a i9n.-i(y)x\ □ 

If we consider f (x + y ) we get 

Lemma 19.44. If 6 is a half plane then f(x) x Y. biy 1 i-> T. f (i) M b n _t/i! 
determines a map 

NV 2 {e 2 ) x NVi(e) — ► Nv 2 (e 2 ) 

Proof. Write f (x + y) = f (l ' (x) y The coefficient of y 11 in f(x + y) ^ bty 1 
is 

— b n _t. 

□ 
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Lemma 19.45. Suppose G is a half plane with reversal constant p. 

1. Ifi e NV d (G d ) then f + p 3 Xl 3 X2 f e NV d (e d ). 

2. The linear transformation g x f i— > g(p 3 Xl 3 X2 )f determines a map 

P pos x NV d (G d ) — ► NV d (G d ) 

Proof. The first part follows from Lemma 119.291 and f 3 X2 f 3 Xl (3 X2 f). 
The second part follows by induction. □ 

19.7 Analytic functions 

We define NV d (C) to be the uniform closure on compact subsets of NV d (C). 
Here are some exponential functions in NV d (C). 

Lemma 19.46. Suppose S C C is a cone. 

1. 1/(3 e e\0f/zCTie x/p e NVi[e). 

2. If p is the reversal constant then e pxv e A/l/jfC). 

Proof. Since (3 + x e NVi(C) we have ^1 + is in NVi(e), and hence 

e x /P g NVj(e). For the second part we know x + y e NV 2 (e 2 ), so 1 + pxy € 
NV2(C 2 ). Taking limits and following the first part finishes the proof. □ 

Proposition 19.47. If G is a half plane with reversal constant p then 

1. f i— > e pd *' d yf determines a map 

NV d (G d ) — -> A/\/ d (e d ) 

2. f (x) x g(x) i— > f (p3x)g(x) determines a map 

NV d (G d ) x A/\/ d (e d ) — » A/\/ d (e d ). 

Proof. We know that g + p3 Xl 3 yi g e NV d (C d ). Thus g h-> (l + p9 ^ ) n g 

maps NV d (C d ) to itself, since the homogeneous part of (l + — — ) n g 
equals g H . Consequently, taking limits shows that the homogeneous part of 
gps*! o Vl g e q ua i s gH^ anc j ^ n particular is not zero. The result now follows from 
Lemma ll9.9l and the observation that 

p3 x ■ 3 y = p3 Xl 3 yi H h p3 Xd 3 yd 

The second part relies on the identity 

e P9«3yg( x )f (y ) =f(p3 x )g( X ). 

y=0 
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By linearity we only need to check it for monomials: 



e p3x3 yx I y J = IEMxI . ( 9y )J y J 




( P 3 X )V 



□ 



Example 19.48. The infinite product cos(x) = 0k=o y- ~ 
shows that cos(yx) e NVi(yH). Taking derivatives shows 




) 



cos(yx) «/v — y sin(yx) and sin(yx) <^ y cos(yx) 
This gives us elements in NV2(yH): 

cos(yx) — yy sin(yx) and sin(yx) +yy cos(yx) 



19.8 Homogeneous polynomials 

Define homog d to be the set of homogeneous polynomials contained in HPd. 
If we don't want to specify d we write homog^ . By Lemma [l9.24l we can write 
f (x) G homog d as a - g(x) where g(x) has all positive coefficients. If d = 2 then 
f(x,y) e homog d is of the form a • G[x,y) where G is the homogenization of a 
polynomial in P pos . In general, f e homog d is a multiple of the homogeniza- 
tion of a polynomial in P^Lv since Pd-i = rHPd-i (Theorem l20.3|l . 

The first lemma shows that there is nothing special about the upper half 
plane in the definition of homog d ; any half plane through the origin will do. 

Lemma 19.49. homog d = C\ y ec\o NV d ((yH) d ) 

Proof. If f e homog d has degree n and cr G H d then f(ycr) = y n f(cr) ^ so 
f g NV d ((yH) d ). 

Conversely if f (x) has degree n and is in the intersection, cr e H d and y ^ 
then f (ycr) ^ 0. But f (ycr) is a polynomial in y, so the solutions to f (ycr) = 
must all be y = 0. Thus f(ycr) is divisible by y n , and f (ycr) = y n f (cr). Since 
this holds for all cr e H d we see that f (yx) = y n f (x), so f is homogeneous. □ 

There is a simple determinant construction for polynomials in homog d , 
but not all such polynomials can be so realized. 

d 



Corollary 19.50. If all Di are positive definite then | } G homog d . 




Next, we construct homogeneous polynomials from elementary symmet- 
ric functions. Fix a positive integer n, and let crt (y \, . . . , y n ) be the i'th elemen- 
tary symmetric function of t)i,. . .,y n - If I is an index set then 




Proof. This follows from Corollary |19.39l 



□ 



CTi(x) = Y[ ffi("U'-vV) 
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Lemma 19.51. For positive integers n and m we have that ^ Oj(x) e homog^. 

|I|=m 

Proof. Since uz-1 e HP 2 

n 

J^xyz- 1) = ^ cT k (x U/ ... / x i , n )(-zr- k e HP, 
i=l k 

and thus 

n^a k (x j4 HnH-z)— k ) eHP, 

j=l ^ k ' 

The coefficient of z m is ± ^|,| =m Oi(x). This is in homog, since all monomials 
have degree m. □ 

Now we have the Schur-Szego theorem for homogeneous polynomials. 

Lemma 19.52. If f = Y_ a i 9 — T. ^i x '' deg(f) = deg(g), and f, g € homog d 

then 

Y ajb/Jlx 7 e homog d 
Proof. If f has degree n then f (3x)g(x + y) is in homog, and equals 
Y_ a: dx 1 bj (x + y) J ^^Qjbillx 1 

|l|=|j|=n 

since (ax'Hx + y) 1 = < ° , 1 ^ ' □ 
y \i!y J I = J 

Lemma 19.53. IfY_ a,x' e HP d fen 21 a,cr,(x) I! e HP d . 

Proof. Note that 

(ax) l [(x + xi)---(x + x n )]| x=0 = a t (x)i! 

and thus 

(Y_ a, 9x') (H fl (Xi + x^)) = ^ a,<7,(x) I!. 

\ i=1 k=1 / x=0 

□ 

Corollary 19.54. The linear transformation x 1 Oi(x)I! maps HP, — ► HP, and 
homog^ — > homog^. 
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19.9 Mobius transformations 

Suppose Mz = ^fij is a Mobius transformation where |M| = ad — be ^ 0. 
If f (xi, . . .,x T ) is homogeneous of degree n then we say that f is M-invariant if 
(Mf )(x) = |M.| n f (x) for all Mobius transformations M where |M| ^ and 

(MfHxj x T ) = [(cx! + d) • • • (cx r + d)] n ■ f(^| ^4^) 

L J v cxi + d cx r + d ' 

We will see (Lemma 119.551 1 that there are no M-invariant polynomials in 
homog d , but we can find polynomials f (x, y) such that 

1. f is M-invariant. 

2. f(x,-y) e homog d 

We will call such polynomials Ruelle polynomials 1 150 1. It is easy to show that 
a polynomial is M-invariant; the difficulty is showing that it is homog d . 

Lemma 19.55. If f € HP n then there is a Mobius transformation M so that Mf ^ 

Proof. Choose on so that f (ai, . . . , oc n ) = 0, and choose |3 so that (3 + on is in 
the upper half plane for all i. If Mz = z — (3 then the roots of 

Mf(xi,...,x n ) = f(x 1 -(3,...,x n -(3) 

are |3 + ai, . . . , (3 + oc n which are all in the upper half plane, so Mf ^ HP n . □ 

We now give several constructions of M-invariant polynomials. They all 
depend on the fact that x — y is M-invariant: 



M(x-y) = (cx+ d)(cy + d) 



ax + b ay 



cx + d cy 



= |M|(x-y] 



Construction 19.56. If (a^ ) = A is an r by r matrix and X = diag(xi, . . . , x r ), 
Y = diag(yi,. . .,y r ) then we define 

f(x,y) = det(AX — YA) = det(ay(xi--yj)). 

f (x, y) is M-invariant since 

Mf = detfciijMfxi - Vi )) = det(|M|a y (xi -y,)) = |M| r f(X,Y). 

Construction 19.57. With the same setup as above we let 

g(x,y) =per(AX-YA) 

where per is the permanent. The same argument shows g (x, y) is M-invariant. 
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If we expand the determinant or the permanent we find that these two 
constructions are special cases of 

Construction 19.58. If <x a e C for each permutation a then define 

T 

Mx,y) = Y- a ^ ]^[(xi -u ai ). 

cresym(r) i=l 

This is clearly M-invariant. Ruelle 1 150] proved that all Ruelle polynomials 
can be written in this form, but the oi a are not unique. 

Example 19.59. If } is the r by r all 1 matrix then Cauchy's determinant for- 
mula [108] gives 

f(x,y) = |JX-YJ| = |(x i -y j )| = n(x i -x j ) • 1^-^) 

i<j i<j 

T 

<rGsym(r) 1 

Note that f (X, — Y) is not in HP2 r since it contains xi — Xj factors. However, we 
will see later that 

r 

per(XJ-IY) = Y. n^-y^ 

cr^sijmfr) 1 

is a Ruelle polynomial. We call this the Grace polynomial, denoted Grace,-. 
For instance, 

Grace 2 = (x 1 -yi)(x 2 -y 2 ) + (*i -U2KX2 -yi) 
= 2xix 2 - (xi + x 2 )(yi + y 2 ) + 2yiy 2 

= 2o 2 (xi,x 2 )o'o(yi,y2) - ffi(xi,x 2 )ff 1 (y 1 ,y 2 ) + 2(j (x 1 ,x 2 )(j 2 (y 1 ,y 2 ) 
The next lemma is Ruelle's original definition of a Ruelle polynomial. 

Lemma 19.60 (Ruelle). f(x,y) is a Ruelle polynomial iff whenever there is a circle 
separating a = {o\, . . ., a n )from t = . . .,T m ) then f (a,x) ^0. 

Proof. Choose a Mobius transformation such that Mcr is in the upper half 
plane and M/r is in the lower half plane. Since f(Mx, — My) is in homog d 
we have that f (Mcr, — Mt) 7^ 0. The converse is harder, and can be found in 

nsa. □ 

Lemma 19.61 (Ruelle). If A is unitary then f (x,y) = |AX — YA| is a Ruelle poly- 
nomial. 
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Proof. We have seen that f is M-invariant. If A is unitary then A 1 = A* so 

f (x,y) = | AX - YA| = |A| |X - A*YA| 
If the rows of A are Aj, . . . , A T then 

f (x, y) = | A| det(x + ^ Vi A* A^j 

Since all A? At are positive semi-definite the determinant is in HP 2r . □ 

Example 19.62. In this example we give the Ruelle polynomial that is con- 
structed using the general three by three orthogonal matrix [11 J where |3 — 
a 2 + b 2 + c 2 + d 2 . 



1 



b 2 -c 2 -d 2 2(bc + ad) 2(bd-ac) 



A=- 2(bc-ad) a 2 - b 2 + c 2 - d 2 2(ab + cd) 

13 \ 2(ac + bd) 2(cd-ab) a 2 - b 2 - c 2 + d 2 

If we expand the determinant we get the following permutation represen- 
tation: 

(y 3 + xi)(y 2 + x 2 )(yi +x 3 ) + 
4(ac + bd) 2 (v2+x 1 ){y 3 +x 2 ){vi+x 3 ) + 
4(ac-bd) 2 (y 3 +x 2 )(ij2 +x 3 ) + 
4(b 2 -c 2 )(a 2 -d 2 ) (yi+x 1 ){y 3 +x 2 ){v2+x 3 ) + 
4(a 2 -b 2 )(d 2 -c 2 ) (y 2 +xi)(yi +x 2 )(y 3 +x 3 ) + 
(a 2 - b 2 + c 2 - d 2 ) 2 (y a + xi)(y 2 + x 2 )(y 3 + x 3 ) 



In order to show that the Grace polynomial is a Ruelle polynomial we need 
to recall Grace's theorem: 

Theorem 19.63 (Grace). If f = Y.o a i xX , 9 = ^Jo ^H* 1 ' an ^ the roots of f are 
separated by a circle from the roots of g then 

n 

J^aibn-iC-l)*!! (n-i)!^0 



Corollary 19.64. The Grace polynomial is a Ruelle polynomial. 

Proof. We know the Grace polynomial is M-invariant; we now show it satisfies 
the second condition. Suppose that x\, . . . , x n and yi, . . . ,y n are in the upper 
half plane. Grace's theorem applied to 0( x — x an d 0( x + yi) shows that 

^(-l)V t (xi, . . .,Xn) • ffn-i(yi, ■ • ./UnX-D* i! (n - i)! 
= Y o , i(xi / .../X n )g n ,-i(y 1/ ... / y u )i! (n-i) 



Comparing monomials shows that this is exactly Grace r (x, — y), and it is 
non-zero for substitutions in the upper half plane. □ 
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Here's a simple one-variable corollary. 
Corollary 19.65. Ifa^M € Rthen 

crSsijm(n) i 

Proof. Replace yi by bt and xt by x + at in the Grace polynomial. □ 

The conjecture below generalizes Grace's theorem: 

Conjecture 19.66. Ifn is a positive integer, cri(x k ) the i'th elementary symmetric 
polynomial on x kj i, . . . ,x k/T1 and a z (x) = nk°ik( x k) ^en 

Y_ o-i(x)I! e ftomog* 

|i|=n 

19.10 Properties of polynomials with zeros in a strip 

Assume that S is the strip {z — 1 < Re(z) < 1}. We will consider properties of 
NV a (C \ §). If f e NVj(C \ §) then all roots of f lie in S. Since § is convex we 
have that 

NVj(C \ S) is closed under differentiation. 
The reverse of x — a has root 1/a, so 

NVi(C \ §) is not closed under reversal. 
A non-trivial property is this interlacing result: 

Lemma 19.67. If S is a vertical strip and f € NV\ (C \ §) then for all real a 

f + iaf g MA(C\S) 
The proof follows from the next lemma. 
Lemma 19.68. Suppose that f = Yli x ~ T k)/ and define 



f + \<xY_ 



ak- 



f(x) 



X-T k 
k 



where ael and all a k are non-negative. If a + bi is a root of g then 
min9i(r k ) ^ a < max$R(r k ) 

k k 
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Proof. Let r k = Sk + ttk- Dividing by f yields 
1+iaV a k — — — - 

± — n -4- hi — 



a + bi — s k — itv 

k 

La - xb - s k + itic 
aic 1 — —r n? = y a 
a + bi-Sk-ttk 2 

k 



Taking the real part yields 

a - s k 



,a + bi-s k -ttkl 2 

Thus a can't be less than all s k , nor greater than all s k - □ 

Corollary 19.69. Ifi e P then the map g f (\D)g determines a linear transfor- 
mation 

A/Vi(C\S) — -> A/V1(C\S) 
Corollary 19.70. I/g e M/ X (C \ §) then 

g + g" + g (4) /2! + g (6) /3! + ••• 6 W1/ 1 (C\§) 
Proo/. Apply the previous corollary to f = e~ x2 . □ 

The polynomial xy — 1 is in NV 2 (C \ 8) since |x| > 1 if x S. If x,y ^ § 
then bey | > 1, and hence xy — 1 ^ 0. This observation will be generalized in 
the next section 

19.11 Polynomials non-vanishing on the open unit ball 

Suppose that Ad is the open unit ball in C d . That is, 

A d = {z e C d | \z\ < 1} 

We will construct some simple non-trivial elements of NVd(D) where D c 
C, and use them to show that NVd(A d ) has non-trivial elements. Recall that 
NV d CD) contains trivial products of the form 

d 

nn^ ^ 

t=i j 

where ay ^ D. The next lemma constructs elements that are not products. 

Lemma 19.71. Suppose that the ball B = {z | z — cr| < r} is contained in C d \ D. 
If (j — (cTi, . . ., cr d ) and < b < r d then 

(x-o-i)---(x-<T d )-be A/V d (D) 
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Proof. If (Ti,...,T d ) g B then |xi - o- t | ^ r, so |(ti - oi) • • • (x d - a d )| ^ r d . 
Since < b < r d it follows that 



(Ti-ffi)---(T d -<r d )-b€NV d (D) 



□ 



Example 19.72. If H is the upper half plane, then the ball {z | \z + i| < 1} is 
contained in C d \ H d . It follows that 

(xi +i) • •■ (x d +i) -1 G HP d 

A similar argument shows that 

(xi+l)---(x d + l)-l€5C d 

Here's a small extension to NV d (A d ). 

Lemma 19.73. iff (x) has all its zeros in the open unit disk then 

f(xi---x d ) g yw d (A d ) 

Proof. Use the lemma with B = A d; and multiply many of the polynomials 
together. □ 

It is easy to see that xy — 1/2 G NV2(C 2 \ A x A) since if neither x nor y is in 
A x A then both have absolute value at least one, so their product has absolute 
value at least one. There is a simple geometric condition for a multiaffine 
polynomial to belong to NV 2 (C 2 \ A 2 ). 

Lemma 19.74. Suppose thatf(x,y) — a + bx + cy + dxy, and let M = ( £ d ). 

2. f e NV 2 (C 2 \A 2 ) <^=> M: C 2 \A 2 — ► A 2 
2. feWl/ 2 (C 2 \A 2 ) M: C 2 \A 2 — > A 2 

Proof. Assume that \y\ ^ 1. If f (x,y) — we must show that |x| < 1. Solving 
for x 

b + dy 

Thus |x| < 1 if and only if M(t))| < 1. The second case is similar. □ 

Recall the result (p. 1616b that if < ai < • • • < a n then Y. Q t xV has all its 
roots in the open unit ball. Here's a conjecture in two variables 

Conjecture 19.75. IfO < qi < • • • < a n then 

^(ai + ajlxVeN^C^A 2 ) 
We can establish the multiaffine case of this conjecture. 
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Lemma 19.76. I/O < a < b then 2a + (a + b)(x + y) +2bxy e NV 2 (C 2 \A 2 ). 

Proof. This is a question about Mobius transformations. We take three points 
on the unit circle, and find the unique circle containing their image. We check 
that M(oo) lies in this circle, and that this circle lies in the unit circle. 

M(l) = |^ M(-l)=l 
3b + a 

. . (2 + i)a + ib a + b 

a + (1 + 2i)b 2b 

2(a + b) b-a 
center = — — radius 



a + 3b a + 3b 

Since the center is on the real line, it follows that the image is the circle with 
diameter M.(l) M(— 1). Thus the image lies in the closed unit circle. □ 

The following lemma constructs a stable polynomial from a polynomial in 
NVi(C\A). 

Lemma 19.77. J/0 < q < • • • < a n then ^J 1 a t (x + - ^) n ~ l & 

Proof. The Mobius transformation z J^ 5 maps the right half plane to C \ A. 
It follows that if f (x) e NVj (C \ A) then f ( fl + x) (1 - x) ) e JCi . If we let f (x) = 
£ aix 1 then f e NVi (C \ A). Multiplying by (1 - x) n yields the result. □ 

19.12 Polynomials non-vanishing in a sector 

We begin with a simple construction of polynomials with no roots in a sector. 
We claim that if f and g have positive leading coefficient then 

f£P pos (n) & geF os (m) => f(x) + g(y) e rNV 2 (S 2n x § 2m ) 
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To see this, we first claim that the image of §2 n under f is the open right half 
plane. If f = n( x + r where r t > 0, and cr e B n , then n + cr e S 2n / so f (cr) 
is the product of n points whose arguments are all less than n/(2n), Similarly, 
if T e §2m then g(x) is also in the right half plane, so their sum is as well. In 
particular, their sum is not zero. 
For example, 

x-+u m erNV 2 (S 2n xS 2m ) 
This construction easily generalizes. 

Lemma 19.78. Suppose that n\, ... ,n d are positive integers. IfalliiM <= P pos (n-t) 
have positive leading coefficient then 



fi(xi) + --- + f d (x d ) e rNV d [S 2ni x ••• x S 2n J 



Chapter 

20 



Upper half plane polynomials 



There is a natural subset of polynomials with complex coefficients that has 
properties similar to P: those whose roots lie in the lower half plane. However, 
we call them upper half plane polynomials because the generalization to more 
variables uses the fact that they are non-zero in the upper half plane. For a 
quick summary with proofs see [59]. 

We first recount properties that follow from the general theory of Chap- 
ter [T9j and then investigate polynomials in one variable. 



20.1 Properties of upper half plane polynomials 

We apply the general results of Chapter[l9]to derive results about polynomials 
non-vanishing in the upper half plane. If UHP is the upper half plane then 
U d (C) = NV d (UHP d ). The reversal constant is -1, and UHP/UHP = C \ 

(-oo,0). If f,g G U d (C) then we write f ^- g if f + zg G U d +i(C). From 
Chapter[l9]we have 

Lemma 20.1. 

1. U d (C) is closed under differentiation. 

2. Iff G u d (C) then f ^- di/dxi 

3. //L^fi(y) G U d+1 (C) thenUv) G U d (C) andh ^- f l+1 . 

4. fg G U d (C) if and only iff G U d (C) and g G U d (C). 

5. Iff «— g then g f. 

6. Iff <— g <— h t/zen f — h <— g. 

7. If Sis symmetric and all Di are positive definite then \S + Y.\M^x\ G U d (C). 
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8. The Hadamard product Y_ di*- 1 x Y. ^iiy) xX l— * Y. iifn-ity)* 1 determines a 
mapP pos x C7 d (C) — ► u d {C). 

9. f(x) x g(x) i ► f(— 3x)g(x) determines a map 

U d (C) X C7 d (C) — > £7 d (C). 

20. The following are equivalent 

a) f(x) e C7 d (C) 

b) f(a + tb) e Ui(C) /or a € M d ,b > 0. 

Proof. The only modification is that we do not need homogeneity in the last 
one. Following the proof of Lemma [19.181 we see (1) implies (2). Conversely 
consider oi, . . . , cr d in UHP. Since 1 is in the closure of UHP we can find an a 
and positive a,b such (ctj, . . ., cr d ) = a+ ocb. Thus, f(cii,. . ., cr d ] = f (a + ba) ^ 
0. □ 

There are two reversals in U d (C), a single variable reversal with a minus 
sign, and a full reversal with no signs. 

Lemma 20.2. 

1. i/Xx^ily) e u d {C) has degree n then ^i-x^iiiy) e u d [C). 

2. If Xi has degree e t then xf ■ ■ ■x^ d f(l/xi,...,l/x d ) S U d (C). 

Proof. Only the second one requires proof. If ff^ 6 UHP then 1/oT G UHP. 

Thus 

f(lM,...,l/cr d ) =f(l/ar,...,l/al) ^0. 

□ 

Theorem 20.3. U d = P d . 

Proo/ We first show u d C P d . If f(x) G U d then define g e (x) = f(x + e(xj + 
• • • + x d ) ) where e > 0. We may assume that f H has all non-negative terms, so 
gj^ 1 has all positive terms. Define a = (0, r 2 , . . . , r d ) and b = (t, 0, . . . , 0). Since 
U d is closed g e (a + bt) G Ui. Since Uj = P, we see that g e satisfies substitution, 
and therefore g e G U d . The conclusion follows since lim e ^o+ 9e = f- 

Next we show u d C U d which implies P d = u d . If f G U d and is homo- 
geneous then f(a + bt) G P pos for all a,b > 0, so f G U d by Lemma 12011 If f 
is not homogeneous then choose c G K d so that f (x + c) has all positive coef- 
ficients. Since f (x + c) G P d we can homogenize it y n f (x/y) = F(x,y). Since 
F(x,y) G U d+ i and is homogeneous we know from the previous paragraph 
that F(x,y ) G U d+ i, and thus f (x + c) = F(x, 1) G U d . Now c is real so it follows 
thatf(x)GU d . □ 

Lemma 20.4. J/f(x,y 2 ) G C7 d theni[x,y) G P d . 



1: need real and im 
parts in Pd 
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Proof. If cr, ffi. e UHP then we can write a — t 2 where t g UHP. Thus 

f(a 1 ,...,cr d ,cr) = f(a a , . . ., cr d ,T 2 ) ^0 
which proves the lemma. 

Of course the converse is false - consider x + 1 and x 2 + 1. 



□ 



20.2 Constructions from determinants 

We can construct polynomials in HP d using skew-symmetric matrices. 
Lemma 20.5. If A is skew symmetric and B is positive definite then 



xA B 
— B y A 



eP 2 



Proof. If b = B~ 1/2 = b 1 then 



b 0\ /xA B \ / b 
bJ l-B yAJ U b 



xbAb I 
-I y bAb 



xC I 
-I yC 



where bAb = C is skew symmetric. We can find an orthogonal matrix and 
D an anti-diagonal matrix with positive anti-diagonal (dt) such that OCO* = 

-D o)- Clearly 







xC I 

-I xC 



0' 



( . xD I . \ 

xD . . I 

—I yD 

-I -yD . 



where the dot is a zero matrix. By (|19.4.1b this determinant is n^didn-ixy — 
1) which is in P2. 

Example 20.6. If A is the skew-symmetric n by n matrix that is all 1 above the 
diagonal, and all —1 below, then 



xA I 
-I A 



-x 



n 



We already knew that this is in P since it's just the even part of (x — l) n . 



□ 



The following corollary was proved by a different method in [62 J. 



Corollary 20.7. Suppose A is skew-symmetric and B is positive definite. The deter- 
fxA B\ 

minant and pfaffian of I g .J have alternating coefficients. 
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Proof. Substituting y = 1 in Lemma 120,51 shows that all the roots are positive, 
and hence the coefficients alternate. Since the pfaffian is the square root of the 
determinant all the roots are negative, and so the coefficients again alternate. 

□ 

We can construct polynomials in d variables of even total degree. 
Corollary 20.8. If A is skew-symmetric, and Bi, . . . , are positive definite then 



f(x) = 







(o 







-Bi 



Bi 




+ 



B d 
-B d 



e HP d 



f (x) is a square, and all non-zero monomials off (x) have even total degree. 

Proof. If we substitute ock + 6ki for Xk where all 8 k are positive, then the 
determinant equals 



A 

- Y_ «kB k 



Z a k B k 
A 





-LPkB! 



LPkBk 




and this matrix is non-zero by Lemma [l9.34l since Y. PkBk is positive definite. 
Since the matrix is anti-symmetric f (x) is a square. Finally, using LemmaQjOl 



f(-x) = 



A 

L x kB k 



L x kBk 

A 



-A 
L x kB k 



L x kBk 

-A 



= f(x) 



□ 



Example 20.9. If d = 1, Bi is the identity, and A is the matrix of the last 
example then 



A 
A 



\A + xxl\ = Y_( 



A=0 



i=0 



In the last corollary the matrix had real coefficients and the polynomial 
was a square. If we allow complex coefficients then we can take the square 
root: 

Corollary 20.10. If A is skew-symmetric and all Bk are positive definite then 

d 

g(x] = |A + i ^XiBil e rHP d 
l 

and all monomials have even degree. 
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Proof. The determinant in Corollary 120.81 equals 

d d 

|A + i Y XjBj||A-i Y Xj Bj| 
l l 

If C denotes the sum then 

|A + xC| = |(A + iC)*! = | — A + iC| = |A — iC| 

Thus both factors are equal and since the conjugate of |A + iC| is |A — iC| they 
both have real coefficients. This also shows that g(— x) = g(x), and there fore 
all terms have even degree. 

□ 

Lemma 20.11. Suppose that Di, D 2 , D3 are positive definite matrices, and S is sym- 
metric. If 

f(x,y) HxDj+uDz + S+tDal 

then f(x,u) e U 2 . 

Proof. The homogeneous part of f is |xDi + uD 2 which has all positive coeffi- 
cients by Lemma [9.51 For any aeK 

f(x,a) = |xDi + (aD 2 + S) +iD 3 | 

and by this is in J since a D2 + S is symmetric. □ 

20.3 SL 2 (K[x]) 

In this section we characterize the set of matrices of SLjCMfx]) whose entries 
interlace. We define 

Definition 20.12. 

SL 2 (R[x]) P = {(9 f) I gh-fk = landf 3- g ^- k,f ^ h ^ k} 
We begin with some elementary properties. 
Lemma 20.13. Suppose G SL2(M[x])p is not constant. 

1. deg(g) + deg(h) = deg(f) + deg(k). 

2. If sign(r) is the sign of the leading coefficient ofr then sign(g) • sign(h) = 
sign(f) • sign(k). 

3. (9 I y 1 ^SL 2 (R[x]) P . 

Proof. The first two follow from the fact that gh — fk is a constant, so the 
leading coefficients of gh and fk must cancel. The determinant of the inverse 
is 1, but all the interlacings go in the opposite direction. □ 
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Remark 20.14. Consider some necessary conditions for the matrix M = ( g * ) 
to belong to SL 2 (MM) P . 

1. M has determinant one. 

2. Degrees of adjacent entries differ by at most one. 

3. All entries are in P. 

4. Either f <-^- g or f h or g <-^- k or h k. 

The matrix ( * x 2^ 1 ) satisfies all conditions except (1). If (2) fails then not 
all adjacent polynomials can interlace. It is possible for only condition (3) to 
fail - consider ( * x * +1 ) . In addition, only condition (4) can fail 

x 2 + 3x x 2 + 3x + 2 
x 2 + 3x - \ x 2 + 3x + \ 

All the entries are in P, but no two entries interlace. 

Lemma 20.15. IfM— ( S has determinant 1 and three out four interlacings then 
M G SL 2 (KM)p. 

Proof. Using the first part of the next lemma we can multiply by certain of 
the first three matrices of the lemma to assure that all leading coefficients are 
positive. The interlacings were preserved, so we know that the degrees must 
be ( n + b n +^ b ) where a, b ^ 0. 

Suppose that we do not know that g — ► f . Since h — ► f we know that h 
sign interlaces f . Now fk — gh = 1, so g has the same sign at the roots of f as h 
does, so g sign interlaces f. Since deg(g) ^ deg(f) it follows that g — ► f. The 
remaining cases are similar. □ 

We now show that certain matrices preserve SL 2 (MM)p- 

Lemma 20.16. Suppose a > 0. Multiplying on the right or left by any of these 
matrices maps SL 2 (MM)p to itself. 

/ n ( -1 \ (lax+b\ ( 1 0\ 
\1 ) \-10) \ -\) V0 1 ) V-ax-blJ 

Proof. The first three follow easily from the fact that 

, U , U U r U 

f <— g & -f < g -q< — f g < f. 

If (o f) G SL 2 (RM)p then 

g A /l ax + b\ fg (ax + b)g + f 
k hj^O 1 J [k (ax + b)k + h 

Now if we have r <-^- s then r + ys G U 2 . Since a ^ we may substitute 

r + (y + ax + b)s = r + (ax + b)s + ys =4> r + (ax + b)s <-^- s. 
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This shows that we have three out of the four interlacings, and since the de- 
terminant is one, the conclusion follows from Lemma 120.161 Multiplying on 
the other side is similar. 
The last one follows from 

/ 1 0\ _ [0 -1\ (1 ax + b\ / l\ 
^-ax-b ) [O 1 ) \-l OJ 

□ 

Proposition 20.17. SL 2 (R[x])p is generated by 

[0 —1\ fc \ fl ax + b 
\l Q ) \Q l/c) [O 1 

where a)0,bel,c^0. 

Proof. The degree of M = is deg(g) + deg(h). If M has degree at 

least two then we show that there are matrices A, B, Mi such that M = 
AMiB where A, B are certain products of the first two generators, and Mi £ 
SL 2 (IR[x])p has lower degree. 

We can multiply on the left or right by the first two generators so that the 
leading coefficients in the top row are positive. If the matrix is ( ? t ) then 

f g, and since they have positive leading coefficients f < — g. Thus we 
can write f = (ax + b)g — s where a ^ 0, g < s, and s has positive leading 
coefficient. We write 

(q f \ (q (ax + b)g - s\ (q -s \ (\ ax + b\ 

where hi has positive leading coefficient, and h = (ax + b)k — hi. We show 
that ( k -hj ) € SL 2 (E[x]) P/ and by Lemma 120151 we need to show that k<h 1 . 

Now deg(g) + deg(hi) = deg(k) + deg(s), and deg(g) = deg(s) + 1 so we 
have deg(hi) = deg(k) — 1. From — ghi + sk = 1 we see that hi and g have the 
same signs on the roots of k. Since g alternates sign on the roots of k so does 
hi. From deg(hi) < deg(k) it follows that k < hi. 

If we reduce a matrix to one of degree one, then after multiplying by ap- 
propriate generators it has the form ( g °'i/ c b ^ , which is easily seen to be a 
product of the generators. 

□ 

Corollary 20.18. SL2(M[x])p is closed under multi-plication. 

Corollary 20.19. € SL 2 (K[x]) P then f + y g + zh + yzk e P 3 . 

Proof. It is clear that each of the first two generators of SL 2 (M[x] )p determines 
a polynomial in P3. The last generator determines the polynomial ax+b+y+z 
which in is P3 since a is non-negative. 

To complete the proof we verify that multiplication by each of the gen- 
erators preserves the property of being in P3. We start with a polynomial 
F = f + yg+zh + yzk e P 3 . There are three cases: 
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( ° g 1 ) This matrix transforms ( g ^ ) to ( g k f h ) which corresponds to 
-k - yh + zf + yzg = -(k + yh + f(-z) +y{-z)g 
which equals — zF(x,y, — 1/z) e P3. 

(0 i/c) This transforms F to c(f + yg + (z/c 2 )h + y(z/c 2 )k) = F(x,y,z/c 2 ) 
which is in P3. 

( ax i +b ) This transforms F to 

(ax + b)g + f + yg + z[(ax + b)k + h] + yzk = F(x, ax + b + y,z) 

which is in P 3 . 



Corollary 20.20. SL 2 (IR[x])p acts on polynomials f(x) + g(x)y + h(x)z+k(x)yz e 
P 3 by matrix multiplication. 

20.4 Matrices of nearly quadratic polynomials 

In this section we consider matrices formed from polynomials in U3 that have 
degree at most 2 in y and z. We show how to construct such matrices of arbi- 
trarily large degree and constant determinant. 

Definition 20.21. M 3 = {(fij)o<i,j<2 I L?,j=o f « M ^ e u s} 

Many operations on F(x,y,z) = Y. fijU x 2s' corespond to matrix multiplica- 
tion. We only consider U3, but the arguments apply to all U^. Let 



Example 20.22. Suppose we replace y by cy, where c > 0. Since F(x, cy, z) e 
U3 the new matrix is in M 3 and equals 



□ 





If we replace z by cz then we multiply on the left 




c 



M 
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Example 20.23. Next we consider F(x,y + a,z) which is in U3 for all a. The 
corresponding matrix equals 

'fro + af 10 + a 2 f 20 f 10 + 2a f 20 ho 
f i + af 11 + a 2 f 2 i f 11 + 2a f 2 i f 21 I = M. 
y f 02 + af 12 + a 2 f 22 f 12 

If we consider F(x, y , z + a) then we multiply on the left by: 






Example 20.24. If a is positive then F(x,y + ax, z) e U3 and F(x,y, z+ ax) e U3. 
These correspond to multiplying on the right by 



or on the left by 



Example 20.25. If we reverse with respect to y then we get 

/. . 1> 
= M . -1 

V ■ 

If we reverse with respect to z then we multiply on the left. 

Example 20.26. In order to get started we need some simple matrices. Since 





(y + z) 2 e U3 the following matrix is in M 3 



and has determinant —2. 



Example 20.27. Finally, we show how to combine these matrices to make ma- 
trices in M3 with determinant —2. Let 




r 



C = x 1 . A = 



The matrix BC(AC) n has degree 2n + 2. For instance, BC(AC) 1 equals 



x 4 -2x 2 + l 2x 3 -2x x 2 
2x 3 - 2x 4x 2 - 2 2x 
x 2 2x 1 
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20.5 J, a subset of Ui(C) 

We now consider polynomials in Ui (C) with positive leading coefficients and 
strict interlacing. 

Definition 20.28. 

3 = {f + gx | f, g e P have positive leading coefficients and f < g} 
jvos = {f -f gx | f 7 g g pP os have positive leading coefficients and f < g} 

3 — {f + gx | f, g e P have positive leading coefficients and f < g} 

3 = the uniform closure of 3 

If h(x) = Y. a i xl then we define h«n = ^IKfcuJx 1 and h/j — Y.3(a.i}x l . 
With this notation, the interlacing condition of the definition of 3 is that 
run < h-3 where h = f + gx. 

Remark 20.29. There are two reasons we only consider <, and not also <C . The 
first is that if we consider a product Yl( x + °i) — f M + la M where (1, £ C 
then the degree of f (x) is greater than the degree of g(x). 

Second, if the degrees of the real and imaginary polynomials are equal then 
the leading coefficient isn't positive nor even real; it's complex. However, this 
is not a serious problem, for we can multiply it to have the correct form: 

Lemma 20.30. Iff <C g, a > is the leading coefficient off and b > is the leading 
coefficient of g then 

[a - bi)(f + ig) e 3 
Proof. The leading coefficient of (a — bx) (f + ig) is a 2 + b 2 . Now 

(a-bx)(f + ig) = af + bg + i(ag - bf ) 

fa bWA_/af + bg\ 
\-b aj\g) \ag-bfj 

The determinant is a 2 + b 2 , and a is positive, so the interlacing on the right 
hand side is strict (Corollary l3.8b . 

It remains to determine the sign of the leading coefficient of ag — bf . Write 

f = Co H h Cn-iX 11 " 1 + QX n 

g = d + • • • + dn-ix^ 1 + b x n 

Since f g we know that | \ | > and this is the leading coefficient of 
ag-bf. ' □ 

Remark 20.31. If F = f + xg then F has no real roots. If r € K were a root of F 
then f(r) + ig(r) = 0, which implies that f(r) = g[r) = 0. This contradicts the 
hypothesis that f and g have no roots in common. 

If f S P pos then there are further restrictions on the location of the roots. 
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Lemma 20.32. If f, g e pv° s / f < g, and f, g have positive leading coefficients then 
the roots off + ig have negative real part and negative imaginary part. 

Proof. We know the imaginary part is negative. If f = Ok( x — r k) an d f (ct) + 
ig(a) —0 then there are positive such that 

^ aj (f/(x-T,)) = g(x) 
] 

]^[(a-r k )+i^Qj— -[](a-r k ) =0 

k j > k 

H-x£_a j /(a.-r j ) =0 

j 

Taking the imaginary part yields 



a — Ti 



Since the a, 's are positive and the Tj 's are negative, it follows that the real part 
is negative. □ 

It's easy to construct polynomials in 3 from polynomials in P: 

Lemma 20.33. Suppose that f e P, a is in the upper half plane and a is positive. 
Then 

1. f(x+a) G 3. 

r 1 

2. f (x + ait) dt e J. 



■1 

f (x + ait) dt e J. 

o 

4. i n f(-ix) eJif-fe P pos (n). 

5. If g(x) € 3 f/zen g(ax) e 3 for positive a. 

6. Iff e P /ias flZZ distinct roots then f (x) + if'(x) S 3. 

Proof. The roots of f (x + cr) all have imaginary part — 3(a) which is negative. 
Corollary 122.301 shows that all the roots of the integral have imaginary part 
equal to —a/2, and so the integral is in 3. 

Iff = x n + ax ™-i + ... theni n f(-ix) =x n +iax n - 1 + --- Since f(x) e P pos , 
all the roots of f(— ix) lie in the lower half plane. Thus the roots are in the 
correct location, and the leading coefficients are positive. 

The remaining ones are obvious. □ 
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A useful property of 3 is the following, which follows from the fact that 
3 C Ui(C), and the leading coefficient of the product is positive. 

Lemma 20.34. 3 and 3 are closed under multiplication. 

Remark 20.35. We can also prove that 3 is closed under multiplication using 
properties of matrices. It suffices to show that if f + gi and h+ ki are in 3, then 
so is their product (fh— gk) + (fk+ gh)i. Notice that both fh— gk and fk+ gh 
have positive leading coefficients, and that the degree of f h — gk is one more 
than the degree of fk + gh. If we write these terms as a matrix product 

/fh-gk\ fh -k\ /f\ 
^fk+gh^ [k hj{gj 

then the conclusion follows from Lemma \3. 5 71 
Here is a simple consequence. 

Corollary 20.36. Ifi < g and n is a positive integer then 
If u = 2m 

Y_ f 2 Vm - 2k (2m\ ( _ 1)k < £ ^XgM-j (2m \ ( _ 1)lt 

k=0 ^ ' k=0 ^ ' 

If rt = 2m + 1 

Y_ f2 k gM - 2 k + 1\ ( _ 1)k > £ f2k+lg2m _ 2k /2m + 1\ ( _ 1)]e 

k=0 ^ ' k=0 ^ ' 



Proof. Expand (f + ig) n . □ 

The following corollary is useful. 

Corollary 20.37. If T: P — ► P preserves the degree and sign of the leading coeffi- 
cient then!: 3 — > 3. If J also preserves strict interlacing then T: 3 — > J. 

Proof. Since T is a transformation that only involves real coefficients, we know 
that 

£R(T(f)) = T(W(f)) and3(T(f)) =T(3(f)). 

□ 

Corollary 20.38. These linear transformations map 3 to itself. 

1. f i— ► f + af ' where net 

2. f I— > g(D)f z^/zere g(x) S P 

3. f i-> EXP(f ). 
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4. f i-> H n . 

Proof. The only observation required is that the second one follows from the 
first. □ 

Remark 20.39. Note that the first transformation is equivalent to the statement 
that if g < h then g + ocg' < h + ah'. This is Lemma [1.551 

Corollary 20.40. If f G 3 then 

1. f(x + i) +f(x-i) e 3 

2. (l/v)(f(x + i)-f(x-i))GJ 

Proof. The first one is 2 cos( D)f, and the second is 2 sin( D)f . □ 

Lemma 20.41. Suppose that Si,S 2 : P — > P are linear transformations that pre- 
serve < and the sign of the leading coefficient, and satisfy Si(f) <S 2 (f) for all f e P. 
Define T(f ) = Si(f ) + iS 2 (g). Tten, 

• T: P — > 3 

• T: 3 — > 3 

Proof. The first is immediate from the hypotheses of the Lemma. Since T(f + 
ig) = Si(f ) — S2(g) + i(Si(g) + S2(f)) the degrees and the signs of the leading 
coefficients are correct. We have 

Si(f)-S 2 (g)<Si(g) since Si(f) < Si(g) < S 2 (g) 
<S 2 (f) since Si(f) <S 2 (f) <S 2 (g) 



Adding these two interlacings gives the desired interlacing. □ 

Here are some t 
sis of the lemma. £ 
leading coefficient. 



Here are some examples of linear transformations satisfying the hypothe- 
sis of the lemma. Suppose that U : P — ► P preserves < and the sign of the 



Si(f)=U(xf) S 2 (f)=U(f) 
Sa(f)=U(f) S 2 (f)=U(f)' 
Sa(f)=U(f) S 2 (f)=U(f) 

Remark 20.42. Here is a different proof of part of the Hermite-Biehler theorem. 
We prove that if f < g and the leading coefficients of f and g are positive, then 
h(x) = f(x) + tg(x) has all roots in — H. First of all, notice that h(z) can not 
have any real roots, since if h(<x) =0 and a is real then f(a) = g(<x) = 
which contradicts f < g. Next, it follows from from Example 13.591 that (x + 
x) n — f(R + gal is in 3 since f>R < ga, and (x + i) n e Ui(C). Since any pair of 
strictly interlacing polynomials can be reached by a path of strictly interlacing 
polynomials, it follows that any pair of strictly interlacing polynomials with 
positive leading coefficients must have all roots in the lower half plane, since 
the roots can never be real. 
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Real axis 



Figure 20.1: Trajectories for f + ti g 



The real parts of the roots of a polynomial in 3 are constrained by the loca- 
tion of the roots of the real part of the polynomial. See Question l95l 

Lemma 20.43. Suppose that f = f«n(x) + vfjfx) e 3, and assume that all the roots 
off<n lie in an interval I. Then, the real parts of the roots ofi also lie in I. 

Proof. We may assume is monic, and write ftn(x) = 0( x — r ^)' — 
Y_ 0Lkfm/( x — r k) where the Qk are non-negative. We show that if £K(cr) is 
greater than all r k then f (cr) ^ 0. The case where IKcr is less than all the roots 
is similar. Dividing by f <n 



= 1+1^ 



ak- 



(j-r k 



Since £Hcr > rk, all terms in the sum have positive real part, and so cr is not a 
zero of f . □ 

Suppose that f < g and consider the plot (Figure l20.lt of the roots of F t = 
(1 — t)f + itg for ^ t ^ 1. From the lemma we know that the real part of the 
roots of F t lies in the interval determined by the roots of f . At t = the roots 
are the roots of f, and as t increases the roots move to the roots of g, except for 
one root whose imaginary part goes to — oo. 

Here's a variation of the lemma where we start with polynomials of equal 
degree. 

Lemma 20.44. Suppose that g A f where f and g have the same degree. Ifi + ig — 
F7(x — Tk — iSk), f has roots a\ ^ • • ^ a n , g has roots b\ ^ • • • ^ b n then 

r ai+bi a n + b n -| 

r k 6 



2 2 J 

Proof. Suppose that a + i|3 is a root of f + ig. Then 

n(a+i(3-a k ) +i[](a+ip-W =0 

a + i(3 - b k 



n 



a + i(3 — ak 



i r (K-b k ) 2 + |3 2 
A A (a-a k ) 2 + |3 2 
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Since g M f we have that b k < a k . Thus, if b k < a k and a < (a k + b k )/2 
then the kth term is less than one, and if a > (a k + b k )/2 then the kth term is 
greater than 1. This implies the result. □ 

20.6 Some simple consequences 

Corollary 20.45. 7ff(x,y) € U 2 and cr has positive imaginary part then f(x, cr) e 3. 

Proof. Since U2 C U2(C) substitution of cr yields a polynomial in Ui(C), and 
this is in 3 since the coefficient of x n is positive. □ 

Corollary 20.46. If f(x,y) G U 2 and 

f(x,y) = f (x)+f!(x)y +f2(x)y 2 + ••• 
^/^e?^ 

f - f 2 + f 4 <fi-f 3 + f 5 

Proof. This is just a restatement of Corollary 120.451 where f«n is the left hand 
side, and f 3 is the right hand side. □ 

Lemma 20.47. Iff = Y_ atjxV G P 2 (2n) f/zen ^ (-l) (lx+i+j)/2 ciijxV 

ij i=j (mod 2) 

is z'n P2 

Proof. Since g(x,y ) —i n f (— tx, — ly ) e 112(C), the expression of the conclusion 
is the real part of g. □ 

Corollary 20.48. If f < g in U2 f/ien f/ze matrix Q ~ f 9 ^ maps a pair 0/ interlacing 
polynomials in U 2 to a pair of interlacing polynomials in P2. 

Proof. The action of the matrix on ( £ J is the same as the multiplication of 
h+tkbyf + tg. □ 

Corollary 20.49. Suppose that f(x,y) € U 2 . Then 

1. ±(f(x,y + i)-f(x,y-t)) G U 2 (C) 

2. f(x,y+t)+f(x,y-i) G r/ 2 (C) 

3. f(x + i,y - 1) + f (x - i,y + 1) - f(x + t,y + 1) - f (x - i,y - 1) G U 2 (C) 

Proof. The first one is sin(D) and the second is cos(D) applied to the y variable. 
If we apply sin to (1) using differentiation with respect to x, then we get (3). 

□ 



Lemma 20.50. If f(x,y) G U 2 then 



f(x,y + it) dt G rHP 2 . 
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Proof. If we write the integral as g + ih where g, h have real coefficients, then 

rl 

f(a,y +it) dt = g(a,y) + ih(a,y) 

o 

Since the left hand side is in 3, we see that g(a,y ) < h(a,y ) for all y. It's clear 
that the homogeneous parts of g and h have all positive coefficients, and so 
they are in U2 and interlace. □ 

Remark 20.51. If f < g in 3 then we can write g in terms of the roots of f, but 
the coefficients might be complex. For instance, suppose 



Di = 



S = 



/ 95 40 78 61 \ 

40 35 39 48 

78 39 70 60 

\ 61 48 60 70 ) 

( 3 6 3 3 \ 

6 12 7 

3 2 6 5 

\ 3 7 5 6 / 



D 2 = 



/ 90 65 73 77 \ 

65 57 65 68 

73 65 83 73 

\ 77 68 73 90 / 



f = |xI + yDj + S + 1D2I 



Since Dj, D2 are positive definite, and S is symmetric, we know that f 6 rHP2. 
If f = f (x) + fi(x)y H then fo<fi. If roots (fo) = (n) then we can write 



f 1 = Y_ Q i 



fo 



ai = -17. - 292.1 
a 3 = -0.775 - 12.3v 



where 



a 2 = -1.38 - 13.2i 
a 4 =3.17-2.45i 



20.7 Interlacing in 3 

We make the usual definition of interlacing in 3 (closure under linear combi- 
nations), and it turns out that we have already seen the definition! We then 
gather some properties of interlacing in 3. We begin with the case of unequal 
degrees. 

Definition 20.52. If f, g e 3, then f < g if and only if f + ccg e 3 for all a e R. 

Suppose that f = f ^ + if 3 and G=gfH + iga-f<gis equivalent to ffR + 
if 3 + oc(grR + UJ3) G 3, which, if we express this in terms of the definition of 3 
is 

fjK + ocg OT < fj + ag 3 . 
It follows from Lemmas ll.51l and ll.52l that we have 

Lemma 20.53. Suppose that f = f ^ + if a and g = g^ + tg^ are in 3. Then f < g 
if and only if 
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1. f<n<9<n 

2. f 3 < g 3 



Such polynomials exist. For instance, 

(8 + 14x + 7x 2 + x 3 ) + i(22 + 25x + 6x 2 ) <(16 + 17x + 4x 2 ) + 1(29 + 15x) 

These polynomials arise from a general construction - see Lemma [20.571 
Here are a few simple properties of < for 3. These are the 3-analogs of 
the usual interlacing properties in P. Note that multiplication preserves strict 
interlacing. Many of these can also be easily proved using the non-vanishing 
definition of 3, but it interesting to see that we can prove them just using 
properties of interlacing polynomials. 

Lemma 20.54. Assume f , g, h e 3. 

1. f<f. 

2. Iff <g and f <hthen f < ag + |3h/or positive a, |3. In particular, ctf+|3g e 3. 

3. If f < g < hthen f - h< g. 

4. Iff < g and 3(cr) > then f + erg e 3. 

5. Iff < g f/zen fh < gh. 

6. Iff G 3 pos thenf[-D)g G 3. 

7. Iff g 3 pos anrf f rev = f$? v + if a ev t/ie« f rev G 3 pos . 

8. Suppose that f factors as Yli x — °k.)- 

fl) — - — G J 

x- ff k 

f 

W f<5" av for any non-negative a^. 

Proof. If f = f>H + vfa then f ' = + vf 3 . Since f«« < fj, the interlacing condi- 
tions hold. The determinant condition is Lemma [l.55l 

In order to see that f < ag + (3 h. we have to check the three conditions, and 
that ag + |3h G 3. The first two follow from the additivity of interlacing, and 
the third one follows from the linearity of the determinant: 



f<n fa 




fm 


h 


+ 




fa 


ag OT + |3run otg 3 + |3h 3 


= a 


9<n 


93 


h«K 


ha 



Since the real parts of f and ag + |3h interlace, as do the imaginary parts, 
and the determinant is negative, we can apply Lemma 11.591 This shows that 
the real and imaginary parts of ag + (3 h. interlace, and hence it is is in 3. 
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If f < g < h. then the argument is nearly the same as the previous one. 
If we write cr = a + bi then f + ag = (f + ag) + i(bg). Since f + ag < bg it 
suffices to consider f + ig. Expanding into real and imaginary parts 

f + ig = (f«H - gar) + i(fa + gm) 
and using the interfacings f^<fa + g^<g3 yields 

9t(f + ig) = fK - g 3 < h + gm = J(f + ig) 

Since f € 3 pos both fg^ and fa have all negative roots, and consequently 
ftft[— x)fjf(— x) has negative leading coefficient, and positive constant term. 
Consequently, Proposition 17. 1 71 implies that we have the interlacing square 

h(-D)g m - fa(-D)g 3 



Y Y 

fgt(-D)ggt f K (-D)g 3 

It follows that 

fm(-D)gm - f a (-D)g 3 < f K (-D)g 3 + f 3 (-D)g m 

Since all the roots of the real and imaginary part are positive, reversal pre- 
serves interlacing. 

Since f < g we know that f + ag G 3 for all real a. Thus hf + ahg = 
h(f + ag) G 3 since 3 is closed under multiplication. 

Since interlacing is preserved by addition it will suffice to show that 
f < f/(x — Ok)- If Ck = i"k + iSk then we compute 

f f 

f + a = • (x — Tk — tsic + a). 

x - r k - ts k x — Tk - *s k 

Since both factors are in 3, so is their product, and thus we have shown inter- 
lacing. □ 

There are two choices for the Hadamard product of polynomials in 3. Sup- 
pose that f = Y_ ^i* 1 an d g = Y. Pi* 1 '- 

1. Same definition as before: f * g = 2T_ OiPix 1 . 

2. Separate the real and complex parts: f * l g = f<n * g« + ifa * gj- 
Surprisingly each one preserves 3. 

Corollary 20.55. Iff G 3, g G 3 V ° S then f* l ge 3 pos . 

Proof. This is Lemma [7791 □ 
Corollary 20.56. Iff, g G 3 pos f/zen f * g e I 1 



1 Unlike P, it's not true that the Hadamard product * maps 3 x 3 pos > 3 
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Proof. Write in terms of real and imaginary parts: 

f * g = (fm * g« - f 3 * gar) + i(f<n * ga + fa * go*) 

Since Hadamard product preserves interlacing in P pos , the interlacing of the 
two sides follows from the interlacings 

f<n * Qm < f« * ga + h * Qm < f a * ga 
The result is only in 3 since f?n * g?n — fa * 9a might have negative signs. □ 



We can construct interlacing polynomials in 3 from polynomials in P£ 
andpP° s . 

Lemma 20.57. 



pos 



1. If fo(x) + fi(x)y + ■•• e and fo /zas aZZ distinct roots then 

ft + lfi+1 < fi+l + lfi+2- 

2. J/^ f i,j (x)y l z' G F^ os and fo ; o /iss aZZ distinct roots then 

fi,j + < T i,j+1 + T-fi+lJ + l 

fi,j + lT i,j + l < fi+l,j +lfi+l,j + l 



Proof. Since f < f 1 by Lemma 19.631 we know from Lemma 19.651 that 
fi+i fl+2 ^ F rom Corollary 19.641 we see that f, <fj+i <L^?. This es- 
tablishes the first part. 

For the second part, since fo,o has all distinct roots, it follows that all in- 
terlacings are strict. Thus, in order to verify the interlacing in 3 we need to 



show that the determinant 
a for x, then Corollary 19. 81 



is positive for all x. If we substitute 



fl,j fi+W 

f ij +1 ft+lj +1 

says that all quadrilateral inequalities are strict if 
f(<x, 0, 0) 7^ 0. Moreover, if they aren't strict, then f(oc,y,z) satisfies the hy- 
pothesis of Lemma 19.801 and so there is a triangular region of zeros. It follows 
that fo,o(x) and fi,o(x) have a common root, which is a contradiction. 

□ 

Example 20.58. Consider an example. Write 

(1 + x + y + z] [2 + x + 2y + z) (4 + x + 3y + 2z) 

as any array with the usual horizontal and vertical interlacings. 
2 

\ 

10 + 5x < 9 

\ I 
16 + 17x + 4 x 2 < 29 + 15 x 13 

8 + 14 x + 7x 2 + x 3 -<- 22 + 25 x + 6x 2 20 + llx ^- 6 
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Each vertical and horizontal arrow determines a polynomial in 3. If we only 
consider the horizontal arrows then we have the following interlacings in 3: 

(10 + 5x)+i(9) 



(16 + 17x + 4x 2 ) + i(29 + 15x) 



(29 + 15x)+i(13) 



(8 + 14x + 7x 2 + x 3 ) + i(22 + 25x + 6x 2 ) (22 + 25x + 6x 2 ) + t(20 + llx) 

t 



(20 + llx) +i(6) 



Lemma 20.59. J/f(x) G 3 then 



f (x + it) dt e 3. 



Proof. If we write f = f m + if j then we know 



F«ft — 



Wx + it) dt e 3 



F 3 = 



f-dx + it) dt e j 



Since ftR + af a e P for all a it follows that FtR + aFj e 3 for all Thus, 
F?R < F 3 in J, and therefore Jg f (x + it) = F<k + iF 3 el □ 



20.8 Particular interlacings 

It is easy to describe all the polynomials that interlace a given polynomial in P. 
This is an unsolved problem in 3. We have a number of examples and simple 
observations. 

Example 20.60. In P, (x — a) 2 <J x — b if and only if a = b. The first surprise is 
that there are many polynomials interlacing (x + i) 2 . We show that a satisfies 
(x + i) 2 < x — ff in 3 if and only if a lies in the disk of radius 1 centered at —l. 

To see this, write <7 = u+iv. The interlacing requirements are met if |u| < 1 
and v > 0. The determinant requirement is that 



> 



-1 2x 

U V 



x 2 (v-2) -2xu 



This means that the discriminant is non-positive, and thus u 2 + (v + l) 2 ^ 
1. The general quadratic appears to also have a simply stated answer - see 
QuestionHH 

Example 20.61. Another difference from interlacing in P is that 
(x + i) 2 < (x + i + a) 2 if and only if < a < 2. 

Note that 

(x + 1 + a) 2 = (x + i) 2 + 2cc(x + a/2 + 1) 
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In order to have interlacing we need that a > and that (x + l) 2 <(x + a./2 + x). 
The latter happens exactly when a| < 2 by the first example 

More generally if x + x) 2 -C (x + a + (3i) 2 , and (3 is not zero, then 

(x + x + a) 2 - (x + x) 2 = 2(oa - i + (3)x H 

If a 7^ 1 then the difference is not in J. 

Example 20.62. If we replace x by Px in the first two examples, and observe 
that interlacing is preserved under x^x + t where r € M, then we see that 

1. (x+ ff) 2 <(x + t) iff |ct- t| < 2 3(ff). 

2. (x+ ff) 2 < (x+ a + a) 2 iff < cc < 23(d). 

As ff approaches the real line, both interfacings converge to the usual interfac- 
ings in P. 

Example 20.63. Here are some of the properties of the polynomials interlacing 
(x+i) n . If (x+i) n <f(x) and we differentiate n-2 times then (x+i) 2 < f (n ~ 2) . 
Now the single root of f ' n ~ 2 ' is the average of the roots of f . Consequently we 
conclude: 

The average of the roots of f lies in the unit disk centered at x. 

Next, we claim that if (x + i) n < (x + n) n_1 where r is real, then < r < -^j. 
To see this, let f = (x + i) n and g = (x + n.) n_1 , and note that 

f«H = x n + • • • f a = ux 11 " 1 + • • • 

9n=x n - 1 + --- f =(u-l)x n - 2 r+--- 

so coefficient of x 2n ~ 2 in f^ga — fagsn is (n — l)r — n. Since the determinant is 
negative, the leading coefficient must be negative, and sot ^ n/(u- 1). 

If (x + ff) n <C (x + T) n then differentiating n — 2 times shows that (x + 
ff) 2 < (x + t) 2 , and so < T - ff < 2. 

Example 20.64. Given n, we can determine an r so that if s € (r, n/(n— 1 ) ) then 
(x + i) n <(x + s t) n_1 . To do this, we compute the discriminant of the deter- 
minant. This is a polynomial in s. The intervals between consecutive distinct 
roots of the discriminant have the property that the determinant has no mul- 
tiple roots for s in the interval. If all roots of the determinant are complex, and 
the leading coefficient is negative then the determinant is everywhere nega- 
tive. 

Once we have these intervals then it is easy to verify that the real and 
complex parts interlace for values of s in these intervals. Table I20rf1 lists a few 
of these intervals. 
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n 


interval 


2 


(0, 


2) 


3 


(1/9, 


3/2) 


4 


(•23, 


4/3) 


5 


(•34, 


5/4) 


6 


(.42, 


6/5) 



Table 20.1: Intervals where (x + i) n < (x + s i) 



20.9 The geometry of interlacing in 3 

There is a geometric condition for the interlacing of polynomials in 3. Al- 
though it is in general hard to verify it does help to understand why polyno- 
mials in 3 fail to interlace. 

Suppose that f,g 6 3 satisfy deg(f) = deg(g) + 1 and have no common 
factors. If they don't interlace then there is an a € K and cr in the closed upper 
half plane such that (f + ag ) ( cr) =0. Now if a = all the roots are in the lower 
half plane, and the roots are continuous functions of a, so we see that 

If f and g don't interlace then there are a, t £ M. such that 

(f + ag) (t) = 0. If they do interlace then (f + ag) (t) is never zero. 

Now if f(t) + ag(t) = then |(t) € M, so arg(f/g) is a multiple of n. This 
gives us a way to check interlacing. Let 

Thus, if deg(f ) = n then f and g interlace iff the org sum 

n n—1 

^arg(x-rt)-^arg(x-Si) (20.9.1) 
i i 

is not a multiple of n for any real x. Sometimes we can use geometric argu- 
ments to show that this is indeed the case. 

Example 20.65. Suppose that the roots of f and g all have the same imaginary 
part, and their real parts alternate. (Figure 120.21 1 Now 

n n—1 n—1 

1 1 1 

Since the roots alternate all of these differences have the same sign. The arg 
sum is the sum of the shaded angles in the figure. It's clear that their sum is 
between n and for all x, and so they interlace. 



argl 
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x 




3(z) =0 



fi 9i f 2 92 f 3 

Figure 20.2: Interlacing example 



Example 20.66. Now we look at the behavior at infinity. If deg(f) — deg(g) + l 
and x goes to positive infinity then the angle between any of the 2n — 1 points 
and x is close to 0, so the arg sum converges to 0. As x goes to — oo the arg 
sum goes to n. See Figure l20.3l for an example of a plot of the arg sum of two 
interlacing polynomials. 




7T 







Figure 20.3: An argument plot for interlacing polynomials 



Example 20.67. The polynomials whose roots are given in Figure 120.41 don't 
interlace. This can be seen from the plot of the arg sum in Figure l2031 since the 
curve crosses the line y — 0. The failure to interlace can also be checked using 
Lemma 120531 




3{z) = 



fl f 2 f3 91 92 

-2-t — 1 — i —i l-i 2-t 
Figure 20.4: Non-interlacing example 



Example 20.68. In this example we give four roots, for which no choice of a 
fifth root leads to interlacing. Consider Figure l20.6l The arg sum is 

ZfjOgi + Zf 2 0g 2 + Z10f 2 = tt/2 + n/2 + Z10f 3 > n 
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Figure 20.5: An argument plot for non-interlacing polynomials 

so no choice of f 3 can give interlacing polynomials. 






h 92 

/ <= \ 

fi 9i 
Figure 20.6: Roots of non-interlacing polynomials 



Example 20.69. We revisit the problem of finding all |3 for which (x + i) n and 
(x + (3i) n_1 interlace. The arg sum is 

A(x, (3) = narctan(l/x) — (n — 1) arctan(|3/x) 

The unique solution to A(x, |3) = is |3 = n/(n — 1) as we saw before. The 
solution to A(x, (3) = 7t determines the lower bound. We find the smallest 
possible |3 by solving 

A(x / p)=0 

ox 

Using the addition formula for arc tangents 

x + y 



arctanx + arctany = arctan 



1-xy 



we see that these are polynomial equations. The first few solutions are in 
TableEOU 

20.10 Orthogonal type recurrences 

We can define orthogonal type recurrences whose members all interlace in 
3; the only new condition is that the imaginary part of the constant term is 
positive. 
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Lemma 20.70. Choose constants > 0, 3(bk) > 0, Ck > 0. If*P-\ = 0, po = 1 
and 

Prt+l = (a n x + b n ) p n - c n p n _i 
then p n e 3 and we have interlacings pi > p2 > P3 > • • • . 

Proof. We prove that p n +i < Pn by induction on n. We first note that pi e 3 
since its only root has negative imaginary part by the hypotheses on do and 
bo- Thus, pi < po since all linear combinations are in 3. 

Next, assume that p n < p n -i- All the leading coefficients of the polynomi- 
als in the interlacing 

(d n X + b n ) Pn < Pn < CnPn-1 

are positive, and so it follows from Lemma [20.541 that 

(d n X + brO Pn - CnPn-1 < Pn 

which finishes the proof. 

□ 

Example 20.71. We get two interrelated recurrences if we separate into real 
and imaginary parts. Consider the simple example 

Pn+l = (" + l)Pn-Pn-l 

If we write p n = r n + is n then 

1"n+l = xr n — T-n.— 1 — S n 
Sn+1 ="S n - S n _i + T n 

The recurrences for just the r's and s's are the same: 

r n +i = 2xr n - (3 + x 2 ) r n _i + 2xr n _ 2 - r n -3 

S n+ i = 2x S n - (3 + X 2 ) S n _i + 2x S n _2 - S n -3 

It is hard to see how one would ever show that the r's and s's interlace if 
we only had these recurrences. 

Example 20.72. Consider the Chebyshev recurrence p n +i = 2xp n — p n -i, 
except we start with pi = x + 1. An easy induction shows that 

Pn = T n + lU rt _ 1 

We can see directly that this is in 3 since the Chebyshev polynomials T n and 
U n have positive leading coefficients and satisfy T n < li n _i. 
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Rising Factorial 


Pn = 


(x)n + l (x) n -l 

[n-1 -i + x\p n -\ + i[n- 


3 + x]p n _ 2 


Hermite 


TV,-, = 

Pn = 


H n + tH n _i 
2xp n _i -p n _j 




Bell 


Pn = 
Pn = 


B n + iB n _i 

X(p n -1 +Pn-l) 




Euler 


Pn = 
Pn = 


B n + iB n _i 

(nx - i)p n -i + (n - 2) ixp n _2 + (x- x 2 )(p n _j + ip n _ 2 ) 


Laguerre 


Pn = 

npn = 


Ln(-x) +lL n _i(-x) 

[x + n - 1 (n - 2)]p n -i + 1 


n-2) Pn _ 2 + (l + i) P ;_ 1 


Legendre 


Pn = 
Pn = 


Pn + lPn-1 

(x + i)p n _i - (n - 2)p n _ 2 H 


" 2x 2 p;_j - 2xp n _ 2 


Gegenbauer 

n(n - 


Pn = 
-l)Pn = 


2(n + 2)(n-l)xp n _i - (n 


+ l)(n + 2)p n _ 2 +2(l-x 2 ) P ;_ 1 



Table 20.2: Recurrences for f n + if n _i 



Here's a simple generalization of that fact. 



Lemma 20.73. Suppose orthogonal polynomial sequences {pi} and {qi} both satisfy 
the recurrence 

Tn+iM = (a n x + b n )r n (x) -c n r n _i(x) 
where all a n and c n are positive. If 

Po = 1, Pi 

then 

p n <q n and 



x, q = 0, qi = 1 



Pn+l 

qn+i 



>0. 



Proof. Let Wi satisfy the same recurrence with Wo = 1 and Wi = x + I. By 
induction w n = p n + iq n , and w n satisfies the conditions of Lemma 120.701 
Thus, w n < w n+ j which gives the conclusions of the lemma. □ 

If we have a sequence of interlacing polynomials f k < f k_i and we let = 
fn + if n-i then all p n are in T. Table l20.10l fists recurrence relations for the p n 
for several polynomial sequences. 
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20.11 The transformation x h- > exp(— ix) 

We have seen (Lemma 15.101 1 that sin(x) — a € P when — 1 < a < 1. The 
next lemma extends this to complex exponentials. We then use this result to 
determine mapping properties of the Chebyshev polynomials. 

Lemma 20.74. If \u\ < 1 then e~ lx + a e 3. 
Proof. If we write x = a + bi then 

e~ lx + a = e- al e b + cr 
Now |e~ al = 1, and e b > 1 since b > 0, so the sum is non-zero since |oj ^ 1. 

□ 

Proposition 20.75. If f e P A then f (e~ lx ) e 3. 

Proof. This is an immediate consequence of Lemmas 120.341 and 120.741 This is 
essentially due to Polya: see |89j page 50]. □ 

Corollary 20.76. If T n and U n are the Chebyshev polynomials then x n i— > T n (x) 
andx n ^U n {x) map p(-W — >P { - U] . 

Proof. We only consider T n ; the case of U n is similar. T n (x) is defined by 
TrLfcosx) = cos(nx), so the diagram below commutes at the element level. 
Proposition l20.75l implies that the top arrow maps as claimed, and hence x n i— > 
cosnx maps as in the diagram. Since g(cosx) € P if and only if g(x) £ P' -1 ' 1 ', 
the proof is complete. 



p(-i,i) — 

X^Tnlx) 

V 

p(-l,l) ^- 



X"H»COSUX 



fi-^JRe(f) 



cos(x)i-+x 

□ 

1 - e~ l y 

Example 20.77. is another function in 3. To see that this is so, note 

V 

that 

1 — e~ yl 1 — cosu sinu 

= + 1 

V V V 

2 sin 2 (y/2) sinu 

y v 

The infinite product expansion shows that sin 2 (u/2)/u and sin(y)/y are in P. 
Moreover, if we approximate them by their partial products then they inter- 
lace ( <). For instance, for zeros up to ±2tc the roots are 
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sin 2 (y/2)/y : -2n, -2n, 0, 0, 2tt, 2n 
sin(y)/y : — 2n, — n, 0, n, In 

If we know that a polynomial is in P A then we can use Proposition 120.751 
to get information about trigonometric and exponential polynomials. We use 
a result due to Enestrom-Kakeya f20l to create a polynomial in P A . 

Theorem 20.78. If f (x) = ao + aix + • ■ ■ + a n x n and 

a n > a n _i > • ■ • > a > (20.11.1) 

then f e P A . 

Corollary 20.79. If the real numbers {a^} satisfy (|20.11.1|) then 

1. 2^k=o ak sm (kx) has all real roots. 

2. 2^k=o a k cos(kx) has all real roots. 

3. The roots of 2Zk=o ak sin(lcx) and YJk=o Qk cos (kx) interlace. 

Proof. Since f (x) = ao + Qix + ■ • • + a n x n is in P A , we can apply Proposi- 
tion|20Z5|to conclude that £ a k e- lkx e 5. This implies (1), (2), and (3). □ 

If we take limits we get 11391 III 205] 

Corollary 20.80. If$[x) is a positive increasing function on (0, 1) with J J 4>(x) dx 
finite then 

f 1 

4>(x)e~ ltx dx 6 5. 

. o 

Proof. Since the integral of cf) is finite we have 

4>(x)e- ltx dx= lim Y -4)(k/n)e- ltk/n 
Jo n ^°°^T 1 n 

Since <}) is increasing the coefficients (l/n)4>(k/n) are positive and increasing. 
It follows from Proposition l20.75l that the sum is in 3, and hence the limit is as 
well. □ 



20.12 Newton inequalities 

We have seen mutually interlacing polynomials evaluated at points in the up- 
per half plane (Figure \3~M . We now give an example of the ratio of the coef- 
ficients of a polynomial in P^ evaluated at a point in Qj. If f = Y. ^ii x )v x 
then Figure 120 . 71 shows the ratios of a fourth degree polynomial evaluated at 
a point cr € Qi. 

There are two striking facts in this picture: 



CHAPTER 20. UPPER HALF PLANE POLYNOMIALS 



617 



f 3 (<T) 
f 4 (<7) 



fa(o-) 



f 2 (a) 



fo(cT) 

fi(cr) 



Figure 20.7: Coefficient ratios of f e P£ and cr e Qi 



1. The ratios are all in the first quadrant. 

2. The real and imaginary parts are increasing. 



The first part is easy; the second part is contained in the following lemma. 
We can differentiate, so we only need to establish it for f o and f \ . Since fo < f \, 
we can write 

fo _ ( y- 

where the at and the Ti are positive. If ff e Qi then cr + n € Qi, and so 

^(cr + r^) 1 eQiv, which establishes the first observation above. 

The lemma below is another generalization of Newton's inequality. If we 
let f = Y. a i xX £ P pos then f (x + y ) e P^. Choosing cr = in the Proposition 
gives that 



o< Di- 



sc m- 



CLi+l Cli+2 

since fW (0)/i! = a\. The imaginary part is vacuous. 

Lemma 20.81. Suppose that Y. fiM U l G P^fn). For all ue Qi and sC i < 
n-2 



sC 91 
3 



ft+iCtr) 

fi(£j 
ft+iCo") 



s; 91 



< 3 



fi+l(£) 
fi +2 (cr) 

fj+ito-) 



Proof. Since a G Qi we know that f(c,u) G 3. The inequality now follows 
from Lemma 120821 □ 

If all the roots of a polynomial lie in a quadrant then there are Newton 
inequalities for the real parts, imaginary parts, and magnitudes of the coeffi- 
cients. 
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Lemma 20.82. Suppose that atx 1 has all roots in a quadrant. The table below 
summarizes the properties of the sequence . . . where "mq,[" means it is 

a decreasing sequence of negative terms, and so on. 





Qi 


Qn 


Qui 


Qiv 


Real parts 


neg4 


pos,t 


pos,t 


neg4 


Imaginary parts 


n.eg,j 


n-eg,| 


pos,T 


pos4 


Magnitude 


T 


T 


T 


T 



Proof. As usual, we differentiate, reverse and differentiate; the result is a 
quadratic with all roots in the original quadrant. We will work with the third 
quadrant since we can use properties of 3. The upper two quadrants follow 
using conjugation, and the fourth quadrant is similar. 
Thus, we may assume that our polynomial is 

(x + a)(x + b) + ic(x + d) 0<a<d<b, 0<c 

We simply need to compute the real parts, imaginary parts and magnitudes, 
and show that the inequalities are satisfied. If we write the polynomial as 
qq + aix + Q2X 2 then 



Q 2 

ao 
ai 



3 — = 
ai 

ao 
ai 



ab + led 
1 

ba 2 + b 2 a + c 2 d 
a 2 +2ba + b 2 + c 2 
c(-ab + db + ad) 
a 2 +2ba + b 2 + c 2 



3 



ai 
ai 



vc 



2a2 



a 2 b z + c 2 d 
a + b) 2 



a 2 

2 



All six terms are clearly positive, and it is easy to check that all three inequal- 
ities are satisfied. 

□ 



The bound for absolute values is stronger that the bound we get from con- 
sidering sectors - see !677l 



Chapter 



Stable polynomials 



A one variable polynomial is stable (sometimes called Hurwitz stable) if all its 
roots lie in the closed left half plane. This is a well studied class of polynomials 
with many applications - e.g. l34l . There are two classes of stable polynomials 
in one variable: 

!K\ — real coefficients and roots in the closed left half plane 
Jfl(C) = complex coefficients and roots in the closed left half plane 



Since polynomials whose roots are in the closed left half plane are non-zero 
in the right half plane, we can apply the results of Chapter[l9]to get properties 
of stable polynomials in d variables. 

Definition 21.1. 

{All polynomials f(xi, . . .,x<i) with complex coefficients 
such that f (oi, . . . , era) ^ for all a\, . . ., o& in the right ^ 
half plane. If we don't need to specify d we simply write 
5C(C). 

call such polynomials stable polynomials. In one variable they are often called 
Hurwitz stable. 

After establishing the basic properties of stable polynomials in d variables, 
we focus our attention on stable polynomials with real coefficients in one vari- 
able. We will see that whenever a polynomial is shown to be strictly positive, 
it's likely that it's stable. There are three kinds of interlacing for stable poly- 
nomials, and positive interlacing is especially useful for showing that a polyno- 
mial is stable. 

For a short summary of the properties of stable polynomials, see | 
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21.1 Stable polynomials in d variables 

We apply the general results of Chapter[l9]to derive results about polynomials 
non-vanishing in the open right half plane (RHP). The reversal constant is 1, 

and RHP/RHP = C \ (-00,0). If f,g G M d (C) then we write f ^- g if 
f + zg G M d+i (C). From Chapter[l9]we have 

Lemma 21.2. Suppose that f G JC d (C). 

2. Ifaie RHP and r t > then ^{r\Xi + Oi, . . .,r d x d + cr d ) G ^d(C). 

2. f(xi + y,x 2 , . ..,x d ) G JC d +i(C). 

3. //(j elfta f(iff,x 2 ,...,x d ) G 5£ d -i(C) U{0}. 

4. 2/f g f/jgjj g <±L f. 

5. 2/f g ^ h f/zer; f + h ^- g. 

6. JC d (C) zs c/osed imder differentiation. That is, if g G M d (C) f/zej-z -^-f G 
^d(C)U{Oj. 

7. f <-tL 3f/3xi 

8. 2/£x i f i (y) G 5t d +i(C) thenhiv) G 5f d (C) U {0} and f t ^- f i+1 . 

9. f(x) x g(x) i— > f(3x)g(x) determines a map 

5C d (C) x M d (C) — v JC d (C). 

20. 2/ £ x l f t (y ) G M d ( C ) /zas degree n then £ x n ~ l f t (y ) G M d (C ) . 

22. 2/xi has degree e t t/zen • • • x| d f (1/xi, . . . , l/x d ) G JC d (C). 

22 . 77ze Hadamard product £ aix 1 x £ fi(y)x x £ aifn-ityjx 1 - determines a 
mapP pos x J{ d (C) — v5f d (C). 

23. If S is skew-symmetric and all are positive definite then 
|S + £ 1 d x i D i | G H d . 

24. The following are equivalent 

a) f (x) G J£ d 

b) f(a + tb) G 'Kxfor^h > 0. 

Proof. Part (1) shows a) implies b). Conversely, consider 0i,...,cr d in RHP. 
Since 1 is in the closure of RHP we can find an a and positive a, b such 
(ax, . . . , cr d ) = a + ocb. Thus, f (a\, . . ., c d ) = f (a + ba) ^ 0. □ 
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21.2 Real stable polynomials 

We now consider properties that depend on real coefficients. It is immediate 
that P pos c JCi; this is still true for more variables. P^° s was defined in terms 
of U d and the positivity of the coefficients. We can also express this in terms 
of non-vanishing. 

Lemma 21.3. f (x) e P^ os if and only if f ( . . . , ^ ) ^ Ofor all ot in the upper half 
plane. 

Proof. We know that f (x) e P^ 0b if and only if its homogenization is in Ua+i- 
Now the homogenization equals x n f(— , • • • , — ) where n is the degree of f, 
so the lemma is now clear. □ 

Lemma 21.4. P^' s = % d n P d 

Proof. We first show that P^ os C K d n U d . Suppose that f(x) e P p ° s , and 
Ti, . . . , T d are in the right half plane. Now ixi, . . . , ix d lie in the upper half 
plane, and so by Lemma [21 .31 

0^f(^ ^)=f(fi Td) 

Thus, f (x) € J{ d . Next, if f € J{ d n P d then f e P d , and f has all non-negative 
coefficients, so f S P^° s by Lemma Hi .581 □ 

Corollary 21.5. 77ze unsigned reversal of a polynomial in P^ os is tn M d . 

Proof. The only point is that Y p ° s c% □ 

Lemma 21.6. Iff G M d f/ien a/Z coefficients have the same sign. 

Proof. By Lemma [11.57| and following the proof of Lemma [l 1 .581 we can write 
f as a limit of f e e 3i d where all coefficients of f e are non-zero. Since the 
coefficients of a polynomial in "K\ with all non-zero coefficients have the same 
sign, an easy induction shows that all the coefficients of f e have the same sign. 
Thus, the non-zero coefficients have the same sign. □ 

Lemma 21.7. Iff (x) e J{ d and the homogenization F off is in •K d+l then f e P p d ° s . 
Proof. If F is in M d+ i then for t , . . . , x d e RHP we know that 

f(^ ^0 

If ci, . . . , ffd S UHP then we can choose To G RHP so that rotates all of 
the at to the right half plane. Thus cri = t^/to where X\ e RHP, and so f is 
non-vanishing on the upper half plane. Thus f € P d , and by Lemma [21 .61 all 
coefficients of f have the same sign. Consequently, f is in P^° s . □ 
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Remark 21.8. Here is a particular example of a polynomial in "K2 whose ho- 
mogenization is not in JC3. Consider f(x,y) = l + (x+y) + (x+y) 2 withhomog- 
enizationF(x,y,z). F has a root in the right half plane: F(l,— .23— 1.86i,2i) = 0. 

Multiplying each variable by a new variable is similar to homogenizing. 
Lemma 21.9. f(yx) G JQ+i if and only if i[x) G Pj. 

Proof. If f(yx) G Md+i then substituting 1 shows f(x) e If ffi, . . ., CT d € 
UHP then choose t € RHP so that tcti, . . .,Tcr d G RHP. Since t" 1 g RHP 

f(cTi,...,CT d ) =f(T _1 (Tff 1/ ...,TCT d )) ^0 

Thus f G Ud, so f G Pj. Conversely, assume that f G Pt. We know that 
the homogenization of f is in Pj +1 . Writing f = Y. a i x ' aR d taking the reverse 
yields 

^aix'y 11 " 111 ^ajx'y 111 = f (y • x) G 5{ d+1 

so the proof is complete. □ 
Lemma 21.10. //f(x,y 2 ) G M d tfien f(x,y) G PP 0S d. 

Proof. See the proof of the corresponding result for Uh , Lemma l20.4l □ 

Lemma 21.11. Suppose that f (x) = ^J 1 aix 1 G JCi. If some internal coefficient is 
zero then f(x] = x T g(x 2 ) rofaere g G pP° s . 

Proof. We can write 

s t 

f(x) =x r x Htx + Ui) x ]^[((x + b0 2 + Ci) 
1 1 

where at is positive, bt, Ci ^ and bid ^ 0. If the second factor is non-empty 
then there are no internal zeros. If any bt is non-zero there are no internal 
zeros, so f (x) = x r f7(x 2 + cO from which the conclusion follows. □ 

Lemma 21.12 (Hermite-Biehler). Suppose that f (x) is a polynomial, and write 
f(x) = f E (x) + fo(x) where f E (x) (resp. fo( x )J consists of all terms of even (resp. 
odd) degree. Then 

f G !K(C] if and on/y z/f E 3- f Q . 

Proof. Since f (x) = f E (x) + f (x) G J£ we know f (— tx) = f e (— «) + f o (— tx] G 
U(C). Now f E (— ix) has all real coefficients, and fo(— ix) has all imaginary 
coefficients, so by Hermite-Biehler for U(C) we know that 

f E (-ix) + (yA)f (-ix)eu(C) 

Returning to IK(C) yields f E (x) +yf (x) G 0i[C). □ 
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21.3 Polynomials (not) in < K d 

We give some examples of polynomials that are (not) in "K & . 

Example 21.13. Since the map z z 2 maps RHP — > C \ (— oo, 0) it follows 
that the equation x 2 + y 2 — has solutions in the right half plane, so x 2 + y 2 



Example 21.14. We claim xy + a 2 e "K 2 if a e R. As above, the image of RHP 
under xy is C \ (— oo, 0), so adding a positive amount will not make xy equal 
to zero. 

Example 21.15. If cr is complex (and not negative real) then x 2 — cr ^ "K\. 
Indeed, one of cr 1/2 , — cr 1/2 lies in the right half plane. Consequently, xy — cr ^ 



These last two examples imply that 

Lemma 21.16. Iff is a polynomial with positive coefficients then f (xy ) e "Kz if and 
only iff eP pos . 

The stable polynomials of degree 1 in 3{ 2 are clearly ax + by + c where a, b 
are positive and c is non-negative. If a, b, c are positive then we know that 



Lemma 21.17. If a, b, c, d are positive then axy + bx + cy + d e "Kz- 

Proof. If a + bx + cy + dxy = then y = — . By the following lemma the 
Mobius transformation z frgf maps the right hand plane to itself. Thus, if 
x e RHP then y g RHP, and so a + bx + cy + dxy e J£ 2 - 

This can also be derived from the corresponding criterion for U2(C). □ 

Remark 21.18. It is not the case that a multiaffine polynomial with all positive 
coefficients is in 5f 3 . If 



then f (1/2 + 1, 1/2 + i, 1/2 + 1) = 0, so f g Jf 3 . 

Lemma 21.19. If the Mobius transformation M satisfies Mz = fx^f t/ien the fol- 
lowing are equivalent: 

1. M maps f/ie right half plane to itself. 

2. At least one of the following holds 

a) a, b, c, d have the same sign (or some are zero). 

b) b, d have the same sign and ad = be. In this case the image is the single 
point b/d. 



3^2 • 




f (x,y,z) = (135/4) + x + y+ z + xy + xz + yz + 24xyz 
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Proof. If r + is € RHP then we need to show that M(r + rs) e RHP. Equiva- 
lently, we show that if r > then £HM(r + ts) > 0. Now 

bds 2 + (a + br)(c + dr) (bc-ad)s 
M(r + rs) = : — — — = hi- 



(c + dr) 2 + d 2 s 2 (c + dr) 2 + d 2 s 2 

and we can ignore the positive denominator, so we need to show that 

bds 2 + (a + br)(c + dr) >0 (21.3.1) 

but this is clear if a, b, c, d, r are positive. If ad = be then Mz = b/d which is 
in the right half plane since b/d > 0. 

Conversely, assume | |21.3. lb holds. Assume that abed 7^ 0; the remaining 
cases are easily taken care of. Taking s large shows that bd is positive, so b 
and d have the same sign. Taking s = and r close to zero shows that ac is 
positive, so a and c have the same sign. Take s = 0. If ab < then the factor 
(a + br) has a positive root, and so (a + br)(c + dr) will be negative for some 
r close to that root, unless the root is also a root of the other factor. In this case 
a + bz is a multiple of c + dz, and so ad = be. □ 

There are many quadratic forms in !H d . The next lemma generalizes the 
fact that x 2 - 1 G P, and x 2 + 1 e 'Ki- 



Lemma 21.20. IfQisadbyd negative subdefinite matrix then 

x l Qx + a € 



"K d ifoc^O 
Pd (fa<0 



Proof. If a < then x t Qx + a e P d by Lemma [10871 Now if f(xi, . . .,x d ) e 
HP d then f(ixi, . . . ,rx d ) e J{ d . Since P d C HP d it follows that 

-(x t Qx-a) = (ix) t Q(ix) +oce 

□ 

The graphs of stable polynomials (Figure 121. H is a sketch of one) are differ- 
ent from the graphs of polynomials in U2. 

The following famous theorem of Heilman-Lieb determines a multiaffine 
stable polynomial. 

Theorem 21.21. Let G = (V, E) be a graph, V = {1, . . . , n}. To each edge e = ij e E 
assign a non-negative real number Ay . Then the polynomial 

M G (z) Y_ E[ Ai S ZiZ ' 

Mis a matching i j £ M 



is stable. 
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Figure 21.1: The graph of a stable polynomial in two variables 



21.4 Stable polynomials with complex coefficients 

In this section we consider polynomials with complex coefficients in one vari- 
able. Since a 90° rotation moves the right half plane to the upper half plane, 
we can easily transfer properties from 3 to Ji\{C). If f has degree n define 
4>(f ) = i n f (— ix). Note that the factor of i n implies that cf) is not a linear trans- 
formation. 

Lemma 21.22. 7/f has positive leading coefficient then f g JCi(C) zjff c))(f) € 3. 

Proof. We may assume f is monic. All roots of f(— ix) lie in the lower half 
plane, and the leading coefficient is (— i) n . Thus, ct>(f ) is monic, and all roots 
are in the lower half plane. □ 

Interlacing in JCi(C) is different form P. Any definition of interlacing 
should satisfy f < f , and this should mean that all linear combinations of f 
and f ' are stable. However, this is false. If f = x + 1 then f — 2f ' = x — 1 is not 
stable. 

3 is invariant under the transformation x x + a, where aeM, but "K\ (C) 
is preserved by x ^ x + w. In addition, %l(C) is preserved by x x + a if 

a > 0. Consequently, there are two new kinds of interlacing along with < — . 
Assume that f , g e Jfi(C) 



f 



f^g iff 
f ~ g iff 
^g iff 



VaeM f + a;g€ Mi(C) 
VaeR>0 f + ag e5Ci(C) 
VaeRHP f+ageJCi(C) 



We will see later that f ~ f and f ~ f ". Taking limits shows that 
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Lemma 21.23. J/f, g G %l(C) f/zen f g => f ~ g and f ~ g 

Remark 21.24. It is not true that f ~ g =>■ f ~ g. If f = x 2 and g = 1 then 
clearly f ~ g, but x 2 + 1 has roots with positive and negative real parts. 
The next result allows us to transfer interlacing results from 3 to !Ki (C). 

Lemma 21.25. f^-gin ?fi(C) if and only if cb(f ) ~ 4>(g) in X 

Proof. Since cf) is a bisection between Jfj(C) and J, the computation below 
establishes the lemma. 

4)(f + atg) = t n ( f(— ix) + aig(— ix) ) 
= *(f)-ai n - 1 g(- l x) 
= cMf)-a<t>(g) 

□ 

Most properties of interlacing in P carry over to ~ in Jfi(C), except for 
some occasional sign changes. 

Lemma 21.26. Assume f,g, he JCi(C). 
2. J/f ~ g t/ien fh ~ gh. 

2. J/cr G RHP then (x + a)f ~ f. 

3. f~f. 

4. J/f g P pos thenf{iD)g e 3Ci(C). 

5. J/ f ~ g and f ~ h f/zCTZ f ~ ag + |3h /or positive a, (3. Jn particular, 
af + |3g G 5Ci(C). 

6. J/f ~ g ~ h in ^ (C) f/zen f + h ~ g. 

7. x-ff€Ki(C) zjffcrG RHP 

8. x^M 

Proof. The first is immediate. The second is a consequence of 

(x + cr)f + oaf = (x + cr + oa)f 

and x + cr + at G !K\{C). The chain rule shows that (j)(f) = 4>(f)'. Since 
f G 5£i(C) implies that 4>(f) G J, we know that 4>(f) < 4>(f)' = *(f). 

We only need to check the next one for f = x + a where a > 0. We see that 
f(t D)g = tg' + ag which is in %l(C) by the previous part. 
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The next part follows from the corresponding result (Lemma 120.541 for 3. 
We have interfacings 

i n f (ix) < i n_1 g(ix) < x n - 2 h(ix) 

in 3 where f has degree n. In 3 we have 

i n (f + h)(ix) = t n f(ix) -i n - 2 K{ix)<\ n - 1 g{ix) 

and the result follows. The last two are trivial. □ 

We have orthogonal type recurrences in Jfj(C), except the last sign is re- 
versed. 

Corollary 21.27. lfa x , d > 0, > then define 

P-i=0 
Po = 1 

Pn = (Un* + Pn) Pn-1 + C n Pn-2 

AZZ p n are stable, and 

H H 
■••P3 < P2 < Pi 

Proof. Using induction, these follow from the last part of Lemma 121.261 and 
the interlacings 



(X + (3 n )Pn-l Pn-1 <"^- Pn-2 



□ 



Lemma 21.28. Suppose that f(x) = Yl( x ~ °k) is in 5fi(C). if flZZ ak are positive 
and 

9(x)=)"^ (21.4.1) 

k x - ak 

t/tCTJ g G JCi(C)«ndf g. 

Proof. By additivity of interlacing for JCi(C) it suffices to show that f 

f/(x — (Jk), or (x — 0k)h <-^- h where h = f/(x — Ok)- This interlacing holds 

H 1 ' n 

since x— Ck < — 1. □ 

Here are some lemmas that use the geometry of C and are useful in proving 
that polynomials are stable. The first lemma gives useful information about 
the mappings determined by quotients. 

Lemma 21.29. Assume that r, s > and cr G Q\. 

1 1 

6 Qiv t -r, r € lower half plane 



t + cr ' (r + cr)(s + cr) 

— — G Qi ^ r G right half plane 

r+o- (r + ff)(s + ff) 6 JV 
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Proof. Suppose cr = a + bx where a, b > 0. The left two follow from 

1 a + r — lb cr a 2 + b 2 + ar + ibr 

r + cr |r + cH 2 r + cr |r + a| 2 



Since 1/ (r + cr) and 1/ ( s + cr) are in the fourth quadrant, their product is in the 
lower half plane. Since cr/ (r + cr) is in the first quadrant, and 1/ ( s + cr) is in the 
fourth quadrant, their product is in the right half plane. 

□ 

In P we know that f and f " don't interlace, but f — od " € P for all positive 
a. There is a similar result for "K\ (C) . 

Lemma 21.30. Suppose that f = n( x ~ °i) isin r K\(<C). Ifallou^ are non-negative 
then 

' a E«. (,-»)(«-.» < 21A2 > 

Proof. If t is a root of the sum of the right and left hand sides of 1121.4. 2b then, 
after dividing by f (t), we know that 

i + T<x ir - = o 

(t- ck)(t- o-j) 

By Lemma E29] the sum lies in the lower half plane. Thus t does not satisfy 
^21.4.2) . so t must lie in the open left half plane. Similarly for the lower right 
quadrant. □ 

If we restrict all the coefficients to be equal, the lemma gives an alternate 
proof that that f ~ f". 

Lemma 21.31. I/f e Jin[C), JR(a) ^ 0, (3 > tten 

af + pf e 5fi(C). 
af +(3f" e 3Ci(C) 

Proof. Assume that f = f7(x — di) where $Kcri < 0. Factoring out f (x), let t be 
a solution of 

oc+ (3 Y" — -— =0 
* — x — Ot 

If SH(t) ~^ then SHI/ (t — crO > 0, so the sum has positive real part, and can't 
equal —a/ 6. The second follows from Lemma |21.30l 

□ 

We know the following corollary holds more genearally in J£d(C), but in 
!Ki (C) we can prove it by induction. 
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Corollary 21.32. iff, g e 3£i(C) then f(D)g € 5fi(C). 

Proo/. Factoring f, it suffices to show that g + ccg' e 3(i (C) for 91(a) > 0. This 
follows from the lemma. □ 

Linear transformations on P induce linear transformations on JCi(C). 

Lemma 21.33. Suppose T : P — > P is a linear transformation that preserves degree, 
the sign of the leading coefficient, and strict interlacing. The linear transformation 



i k T(x k )(-ix) 



maps Jfi(C) to itself. 



Proof. Since T induces a linear transformation in 3 (p. 16001 1, the conclusion 
follows from the diagram 

£a k x k >£a k (-i) k T(x k )(tx) 



^f[a k (-i) k l(x k ) 



□ 



Corollary 21.34. The transformations x k i— ► i k Hi c (— ix) and x k i— * t k L k (ix) map 
Jfi(C) to Jfi(C), and Jfi to Jfi. 

Proof. The maps x n i— > H n (x) and x n i— > L n (— x) satisfy the hypotheses of 
the lemma. It follows from the above that they map < K\('C) to itself. Since 
i k H k (— ix ) and i k L k (ix) are real polynomials, the maps also send Jfi to itself. 

□ 

If all roots of two polynomials lie in given quadrants then we know the lo- 
cation of the roots of their Hadamard product. In the table below the notation 
— "K\ (C) means all roots in the right half plane, and —3 means all roots in the 
upper half plane. Recall that Y. a i %x * Y. ^i* 1 — Y. cubix 1 . 



Lemma 21.35. (The location of f * g) 



Quadrant 





1 


2 


3 


4 


1 


3 


-5Ci(C) 


-3 


J£i(C) 


2 


-3Ci(C) 


-3 






Quadrant 3 


-2 


5fi(C) 




-JCi(C) 


4 


5fx(C) 


3 


-JCi(C) 


-3 
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Proof. If f and g are in the third quadrant then they are in 3 pos , and so their 
Hadamard product is in 3. The Lemma now follows from the identity 

f (ax) * g(|3x) = [f * g] (afix) 

which holds for all complex a, |3. □ 

Note that this shows that the Hadamard product of stable polynomials 
with complex coefficients is not necessarily stable. For example 

(x + 1 + 1) * (x + 1 + 2i) = x - 1 + 3i. 
21.5 The analytic closure 

We define the analytic closure of !K\ and 3fi(C) as usual: 

5£d = the uniform closure of 
3^d(C) = the uniform closure of J{d(C) 
We can determine the exponentials that are in %\ (C), 
Lemma 21.36. 

1. e ax , e"" 2 are in "K\for all positive a. 

2. e aix is in JCi(C) for all a e K. 

3. sinh(ax)/x and cosh (ax) are in "K\for a e K. 

4. e-*,e- xl arenotin Jti(C). 
Proof. Consider the limits 

e ax = l im (l + ^) n 

e- 2 =]im(l+ a(x + 1/n)2 ) n 

n^oo v TV 

e aix = lim(l + ^) n 

n^oo TL 

In the first two cases the part being exponentiated is in VL\, and in Ji"i(C) in 
the third, so the limits are as claimed. The infinite product formulas 

x 2 \ , , , -Ft r Ax 2 



sinh(x)/x = h + j cosh(x) = H 1 + 27t2 

n=l V 7 n=l V 



show that sinh(x) /x and cosh(x) are in Jt\. 

To show non-existence we use Corollary 121. 321 If e~ x e Jt"i then e~ D f e 
Ml, but this fails for f = 2x + 1. If e~ x2 e JfJ then e _D2 f e K x , and this fails for 
f = (2x + l) 2 + l. □ 
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21.6 Real stable polynomials in one variable 

The class "K\ shares many properties with P pos , as the first result shows. 
Lemma 21.37. 

1. I/f, g have real coefficients then f g e "K\ iffi and g axe in HCj. 

2. //f e JCi then all coefficients ofi have the same sign. 

3. Iff e "Ki then f Tev e "K\. 
Proof. The first is obvious. Factor 

where the Tj are negative, and the a\ have negative real part. Expanding 
(x — cr k )(s — ffk) = x 2 — 29l(ff k )x + |cr k | 2 shows that all the factors have positive 
coefficients, and so the coefficients of f are all positive. 

The roots of f rev are the inverses of the roots of f . Since the roots of f lie in 
the open left half plane, so do the roots of f rev . □ 

The converse of 2 is false: 

(x - 1 - 2x)(x - 1 + 2i)(x + 2 - i)(x + 2 + 1) = x 4 + 2x 3 + 2x 2 + 10x + 25 

There are simple conditions for the stability of polynomials of small de- 
gree. 

Lemma 21.38. Assume all these polynomials have real coefficients. 

1. A quadratic ax 2 + bx + c is stable if and only if all coefficients have the same 
sign. 

2. A cubic x 3 + ax 2 + bx + c is stable if and only if all coefficients are positive and 
ab > c. 

3. A quartic x 4 + ax 3 + bx 2 + cx + d is stable if and only if all coefficients are 
positive and abc > c 2 + a 2 d. 

Proof. We know all coefficients must be positive. The first one is trivial. For 
the remaining two, we convert to 3, and the interlacing condition becomes the 
inequalities. 

x 3 + ax 2 + bx + c is stable iff 
x 3 — bx + i(ax 2 — c) is in 3 iff 
x 3 — bx < ax 2 — c iff 
Vb > y/c/a which is equivalent to ba > c 
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x 4 + ax 3 + bx 2 + cx + d is stable iff 
x 4 — bx 2 + d + x(ax 3 — cx) is in 3 iff 
x 4 — bx 2 + d < ax 3 — cx iff 

X - (b - Vb 2 - 4d) < ^ < i (b + Vb 2 - 4d) iff 
c 2 + a 2 d < abc 



□ 



In ove variable F E is a sum of even powers, and f o is a sum of odd powers 
so we can refine the previous result. Recall that f e and f are the even and 
odd parts of f . 

Lemma 21.39. Assume that f is a polynomial of degree n with all positive coeffi- 
cients. 

f e < f o ti even 



f € Mi if and only if 



f e <C f o Tl Odd 



Proof. We first assume that the degree n of f e "K\ is even. If we write f (x) = 

f e (x 2 ) +xf D (x 2 ) then 

4>(f) =i n [f e (-x 2 )-ixf (-x 2 )] 

Since n is even f e (— x 2 ) <xf Q (— x 2 ) and hence f e (x) <f Q (x), which implies 
iMf) € 3 pos . The converse direction follows by reversing the argument. 
If n = 2m + 1 then 

4>(f) = (-u m (xf (-x 2 )+tf e (-x 2 )) 

which implies that xf ( — x 2 ) < f e ( — x 2 ) , and hence f e <C f . □ 

It follows from the lemma that if the degree of f e J£i is even then f e +tf e 
3. If the degree is odd, then (1 - 1) (f e + xf ) e X 

If x is a root of f e "K\ then f /(x — t) is not necessarily in "K\ . If we pair the 
complex roots then we get interlacing: 

Lemma 21.40. Suppose that f (x) e "K\, and write 

n m 

j=l k=l 



If all a\c, |3j are non-negative and 9t(ffk.) < then 

r H v~ a f(x) ^ X-9t(<T k ) 

* V — Tj ^ IX — (Ti. I IX — fTi. 



j=l J k=l 
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Proof. The conclusion follows from Lemma l21.28l and the observation that 

X-9t((T k ) 



(X- CTkKx- Oic) 2 U-(J k X-CT k 



□ 



Example 21.41. It is not easy to determine all polynomials interlacing a given 
polynomial. For example, we will find all linear polynomials interlacing (x + 
l) 2 by translating to 3. Assume that (x + l) 2 <ax + n. Then 

(x + l) 2 + ai(crx + T) e 3ii{C) 

for all real a. Since the sum of the roots is —2 — aicr and lies in the left half 
plane for all a it follows that 3(a) = 0. After scaling, we may assume that 

(x + l) 2 <x+a + bi. 
We now translate this interlacing to 3: 

x 2 - 1 + 2ix < x + ai - b 
The determinant condition is 



> 



x 2 - 1 2x 
x — b a 



(a-2)x 2 + 2xb - a 



Solving the equations yields 

1. (x + l) 2 <jx + t in JCi(C) if and only if t lies in a circle of radius one 
centered at 1. 

2. (x + l) 2 < x + 1 in Oil if and only < t < 2. 

If < a < 1 then we can give an explicit polynomial in JC2 that shows that 
(x + l) 2 <x+ a. 



y + x (a 2 + Va 2 + la + l) + 1 a 

-a y + x (a 2 - Va 2 + la + l) + 1 

= (1 + a 2 )(l + x) 2 + 2(1 + a 2 ) (x + — ^ )y + y 2 

We have seen the first kind of region in 3. Restricted to real polynomials, 
we get an interval in r K\, but just a single point in P pos . 
A similar calculation shows the following: 

(x + 1) (x + 2) < (x + 1) + a(x + 2) in Jfi iff a > -1/2 
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Iff = fl(x — Wi) € P then in Lemma fl .201 and Lemma fl. 231 we gave con- 
ditions on constants Ct such that Y_ c iV( x — w is m P- If f G P pos we now 
ask what choice of constants determines a stable interlacing polynomial. The 
example above shows that some constants can be negative. 

Lemma 21.42. Suppose that f = (x + T\) ■ ■ ■ (x + r n ) where < r\ < ■ ■ ■ < r n . If 

a.k + a k+ i H h Qn ^ 0/or fc=2,. . . ,n 

H V" f 

and af kasf one swm zs positive, then f < — > a k m Jti. 

^— x + r k 

Proof. We need to verify that the quotient 



1 v- f \- 



r k ^— x + r k 

maps JH(ct) > to the right half plane. Let a = <x + (3i with a > 0. Then, 
~ v- a k ~ v- r k + a - (3i 

*L^ = *L % r , +a)2+p2 

r k + a 



^ ak (r k + a) 2 + |3 2 
^ (Tn + a V + p^ afc(a + Tfc) 



This is positive since 



Y ak(a + r k ) = (a + ri)(ai H ha n ) + (r 2 -ri)(a 2 H h a n ) + 

(r 3 -r 2 )(a 3 + h a n ) H h (r n -r n _j)a n > 

by hypothesis. □ 

Example 21.43. If < ri < r 2 and we choose qi + a 2 > 0, a 2 > then f = 

(x + ti)(x + r 2 ) < ai(x + r 2 ) + a 2 (x + ri) in Jfi. Scaling so that qi + a 2 = 1, this 
is f <x + x\ + (r 2 — Ti)ai. Since 1 ^ ai, it follows that 

(x + ti)(x + r 2 ) < x + t in JCi if < t < r 2 

We can determine all t s K for which (x + ri) (x + r 2 ) < x + t in !Hi. Con- 
verting to 3 yields x 2 + (r\ + r 2 )xt — rir 2 < x + it. The interlacing conditions 
are satisfied, and the determinant condition yields 

(x + T!)(x + r 2 ) <x + tin Jfj iff < t < r a +t 2 

Although the example shows that the hypotheses of the lemma do not 
determine all interlacing polynomials, there is a sense in which they are the 
best possible. 
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Lemma 21.44. If qi, . . ., a n have the property that for all < r\ < ■ ■ ■ < r n we 

have 



f 

f < Y a k — mIKi 



k=l 

w/tere f = (x + n) • • • (x + r n ) then 

Qk + aic+i H h a n ^ 0/or fc=2,. . . ,n 

Proof. Assume that a k + • • • + a n < 0. Take r = r\ — ■ ■ ■ = r k _i and s = r k 
• • • = r n . Then the real part of the quotient, where cr = a + |3r, is equal to 

k-1 Tl 

^- r+a ^- s + a 



k-i 

r + a 



L, s + a v- 



|a + r|^ Icr+sl- i 

Since the second sum is negative this expression will be negative if we can 
choose constants so that 

r+a /s+a 

< e 



for appropriately small positive e. This is the case if we pick |cr| > s > r > 
a. □ 

The next lemma follows from the corresponding facts for JCi(C). 
Lemma 21.45. 7/f,g e Jtj f/zen f(D)g e I/f e P pos f/ien ffD^g G JCi. 

Combining the positive and negative cases, we get 
Corollary 21.46. Iff e P pos then 



f + af" e 



P a 

?fi a > 



We describe some interactions among pT° s , J£ 1/ JCi(C) and 3. 

Lemma 21.47. If F e 3 /zas aZZ coefficients in the first quadrant then Fe JCi(C). 

Proof. Write F = f + ig. If f = T~[(x — r k ) and g/f = Y_ where all a k are 
positive then (f + ig)(a) =0 implies 

Icr-nl 2 

It follows that the real part of cr lies in the interval (minr k ,maxr k ). If all the 
coefficients are positive then all roots of f are negative, so the real parts of a 
are negative, and F e "K\ (C). □ 
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The intersection of JCi(C) and 3 contains polynomials that do not have all 
positive coefficients. If f + ig is 

(x - .08)(x + .7)(x + l)(x + 1.7)(x + 2) + i(x + .3)(x + .9)(x + 1.7)(x + 1.8) 
then the roots are all are in the third quadrant 

(-1.9 - O.li, -1.7 - 0.003t, -0.9 - 0.04t, -0.5 - 0.5i, -0.2 - 0.2i) 

Lemma 21.48. Assume that f, g e P pos . Iff g in P then f ^- g in 

Proof. Iff ^ g then f + yg e Y\ os c K 2 , so f ^- g. □ 

The converse is false. 
Remark 21.49. The image of "K\ in 3 is easily described. If f = ^ n a k x k then 



*(f)=t-^x k a k (-ix) k 

= Y. x k a k (-l)<- k '/ 2 + t Y. x k a k (-l)(- k - 1 '/ 2 



Thus the real part consists of the terms of even degree with alternating signs 
and the odd part consists of terms of odd degree with alternating signs, or vice 
versa. This is equivalent to g(x) = g(— x). Since 4> is just a rotation, the roots of 
4>(f ) are all in the lower half plane, and are symmetric around the imaginary 
axis. Note that the image of Jti isn't 3 V ° S . 

Lemma 21.50. Iff e P pos , and T(x l ) = x^ 1 for i ^ 1 and T(l) = then 
j.ppos — 

Proof. Note that T(f) = (f(x) - f(0))/x. If f = Y\[x + at) where all en are 
positive then a is a root of T(f ) implies that f (a) = f (0), and therefore 



If x has positive real part then x/ch also has positive real part, so |1 + x/at| > 1. 
Consequently the product above can't be 1, and so no roots are in the right half 



n 



k=n (mod 2) 



k^n (mod 2) 




plane. 



□ 



21.7 Coefficient inequalities in "K\ 



Newton's inequalities do not hold for "K\. This can be seen easily in the 
quadratic case. If f = a + aix + a 2 x 2 e J£i then 
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^4 if the roots are real 



Q Q2 



^4 if the roots are complex 

Moreover, the quotient can take on any value in (0, oo). 

However, there are many inequalities involving the coefficients. For in- 
stance, if f = Y. a t xl <= 3^1 then we know that 

ao — <i2X H < ai — Q3X H 

and therefore | £j al I > which is equivalent to 



uoa 3 



> 1 



(21.7.1) 



A much more general result is that the Hurwitz matrix is totally positive. (See 
p. 16491 ) This means that 

\ 



( al 


a 3 


a s 


0.0 


a 2 


Q4 





ai 


a 3 





ao 


Q 2 


{'■■ 







/ 

is totally positive. For example, if we choose f g "K\ (4) then 

0€ 



ai a 3 
a a 2 a 4 
ai a 3 



oil Q2Q3 - Q1CL4 - cigao 



This inequality is the necessary and sufficient condition (Lemma 121 .38|> for a 
positive quartic to be stable. 

Here's a simple yet surprising conclusion about initial segments of stable 
polynomials. 

Lemma 21.51. IfY. ckx 1 e P pos then 

12 3 4 

y~ cLiX 1 ~ V" atx 1 ~ V" Qi.x T ~ V" atx 1 in Jfi 



k=0 k=0 k=0 

7« particular, each of these partial sums is stable. 



k=0 



Proof. We begin with the middle interlacing; the first is trivial. We need to 
verify that 

a(ao + aix+ Q 2 x 2 ) + (ao + aix+ a2X 2 + a 3 x 3 ) = (1 + a)(ao + aix+ a2X 2 ) + a 3 x 3 
is stable. From (|21.7.1|l we know that aja2 > aoa 3/ so 

(1 + oc)a\ (1 + oc)a2 > (1 + a)ao a 3 
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and therefore we have positive interlacing. For the last interlacing, we need 
to show that 

(1 + oc)(ao + aix + a2X 2 + CI3X 3 ) + CI4X 4 
is stable for positive a. To do this we need to show that 

(1 + oc) 3 aia2a3 > a 4 (l + a) 2 a 2 + (1 + a) 2 a 2 a 

The Newton inequalities give 

a\ ^ ^a a 2 a 2 ^ |CLia 3 a\ ^ ^a 2 Q4 

and as a consequence 

G2CL3 ^ 6aia4 qj Q2 ^ 6Q0Q3 

and so 

(1 + a) 3 aia2d3 ^ 6(1 + a) 2 a 2 a4 
(1 + a) 3 aia 2 a 3 ^ 6(1 + a) 2 a 2 a 

Addition yields the desired result. □ 

Note that the first two only require that f is stable, but the last interlacing 
usesfeP pos . 

Remark 21.52. It is not true that the higher partial sums are stable. For k > 5 
the partial sum 1 + x + ■ ■ • + x k /k! of the exponential series is not stable [187]. 
The sum of the first k+1 coefficients of (l+x/n) n converges to l+x+- • -+x k /k!, 
and (1 +x/n) n is in P pos . Thus, for n sufficiently large the partial sums aren't 
stable. For instance, the sum of the first six terms of (1 + x/n) n is stable for for 
n < 23 and not stable for n > 23. 

21.8 Positive interlacing 

We study positive interfacings f ~ g. If f, g in P, then one way of showing 
f < g is sign interlacing. This can be viewed as describing the behavior of the 
map 

— : K — > M\oo. 
f x 

In order to prove that f ~ g in "K\, we look at the image of the first quadrant 

rather than the real line. The next lemma gives criteria for £ and ~. The proof 
is omitted. 



Lemma 21.53. 
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1. figzjf Ch^RHP. 

2. i-giff^. d— C\(-oo,0). 

If f , g G P then the implication f < g =>• f ~ g is trivial. It's also easy in 
Lemma 21.54. Ifi, geJfj and f < g implies f ~ g. 

Proof. We know that the image of the closed right plane under xh -^M lies 
in the open right half plane. Thus, f ~ g. □ 

We have seen (See p.EU) that if f ~ g, H in P pos then g + h might not have 
all real roots. However, 

Lemma 21.55. Iff ~ g,h are in P pos then g + h is stable. 

Proof. From Lemma [21. 531 we know that evaluation of g/f and h/f at a point 
in the closed first quadrant gives a point in the right half plane, and so their 
sum lies there too. □ 

Lemma 21.56. Ifi g then f ~ x g 

Proof. We assume f, g € "K\; the case f, g € Jfi(C) is similar. Since f < g we 
know that f/g : Qi — ► RHP. To show f ~ xg we choose a ^ and prove that 
f + axg e J£i. We do this by establishing that f + axg ~ f . If cr e Qi then 

f+axg f(a) 
a = 1 + occt- 



f ; g(cr) 

Now (g/f)(ff) G RHP since f < g and ctcr e RHP so tkf^-(o) is in RHP + 
RHPx RHP = C\(0,oo). □ 

Next is a Leibniz-like rule for second derivatives. 

Lemma 21.57. Ifi, g e P pos fhen f g" + f "g e JCi. 

Proof. Recall Lemma 121.531 If we evaluate f/f" and g/g" at a point in the 
closed first quadrant then we get a point in the lower half plane. Thus f/f" + 
g/g" is stable. □ 

Note that f g" + f "g might not lie in P pos . For example, if f = (x + l) 2 and 
g = (x + 2) 2 then f g" + f "g is always positive, and so has no real zeros. 
Positive interfacings can be multiplied. 

Lemma 21.58. Ifi < g and h < k in P pos 
1. fh~ gkin Jf x . 



CHAPTER 21 . STABLE POLYNOMIALS 



640 



2. fk~ ghinMi. 

3. f 2 ~ g 2 in JCi. 

Proof. We know that f/g and h/k both map the first quadrant to itself, so 

(fh)/(gk) maps the first quadrant to the upper half plane, so f h. ~ gk. The last 
is a special case of the first. Since k/h maps Qi to Qiv, we see that (fh)/(gk) 
maps to Qiv U Qi U Qh \ 0, from which the second statement follows. □ 

Corollary 21.59. If qi, bt, ct > 0, then the sequence defined by 

P-i =0 
Po = 1 

Pn = (d n X + b n )p n _i +C n p n _ 2 

then all p n are stable, and 

H H 
..-P3 ~p 2 ~Pl 

Proof. From Corollary 121 .271 we know that Vn <Vn-i, and so p n ~ Pn-i by 
Lemma I2L541 □ 

The preceding lemma can be considered as a construction of stable orthog- 
onal polynomials. There is a simple case where the roots are all known explic- 
itly 1801. If fi = 1, f 2 = x and 

Ttv+iM =*fnM + fn-i(x) 

then the polynomials are known as the Fibonacci polynomials and have 
purely imaginary roots: 

2icos(k7t/n), l<k<n-l. 
Remark 21.60. If we take two stable polynomials 

f = x 4 + 6x 3 + 14x 2 + lOx + 4 g = x 2 + 2x + 2 

then how do we go about showing that f ~ g? If we use the definition then we 
need to show that f + t g e %\ for all positive t. Since this is a quartic we could 
apply the criterion (Lemma 121.381 for a quartic to be stable. A more general 
technique uses the fact (Lemma l21.53|l that f/g must map the closed first quad- 
rant to C \ (-oo,0]. If cr g Q~7 then this would follow from 3(f(cr)/g(cr)) > 0. 
To show this we just need to show that the polynomial 3(f (a + bi)g(a — hi)) 
is positive for a, b 0. If we are lucky, this polynomial will have no minus 
signs, as in this case: 



f (a + bi)g(a- bi) = 2b (a 2 + b 2 ) 3 x 
(a 5 + 6a 4 + 2b 2 a 3 + 16a 3 + 8b 2 a 2 + 27a 2 + b 4 a + 8b 2 a + 24a + 2b 4 + 3b 2 + 6) 
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This approach doesn't quite work here, since a = b = has imaginary part 0. 
However, f (0)/g (0) = 2 which is not on the negative axis, so we conclude that 



, H 

f ~ g. 



f(x) Uv) 
g(x) g(y) 



zs stable iff <—— g G pT° s 



The Bezout polynomial is stable. 
Lemma 21.61. The Bezout polynomial 
Proof. Recall (9233 

B(x,yj = > at 

^— x - n y - n 

B(x,y) = y a _L 1_ 

f(x)f(y) ^— l x-nD-Ti 

If f, g e P pos , cr e Qi, re Qi then 7/^7-7^7- is in the lower half plane, and so 
the sum misses (— 00, 0). If a e Qi, t e Q4 then ^ — j^- is in the right half 
plane, and so the sum again misses (—00, 0). The remaining cases are similar, 
so we conclude f(x)f (y) ~ B(x,y). 

□ 

When a polynomial is always positive then it is not necessarily stable. 
Here's a familiar case where it is. 

Lemma 21.62. Iff < g in P pos then f f , g 9 , is stable. Iff has all distinct roots then 
I f f , f ',, J is stable. 

Proof. If we write f = Y\i x + r i) an d g = Y. a iV ( x + T i) then 

f'g-fg' = f 2 V-^- 

and Lemma [21. 53l applies. Use f < f ' for the second one. □ 

Example 21.63. Here's a nice example of the lemma. Assume that fo, f 1, ^2, ■ ■ ■ 
is an orthogonal polynomial sequence with strict interlacing: f ^+1 < f k for k = 
1, The Christoffel-Darboux formula [168J states that 



f (x)f (y) + fi(x)fi(y) + • ■ • + f n (x)f n (y) 



for positive constants k n . If we let x = y then 



k n +i x - y 



fn(y) fu+l(lj] 

fnM f n+ l(x) 



f (x) 2 + f!(x) 2 



Ux) 



fn(x) frv+lM 

f;tx) f; +1 (x) 



It follows from the Lemma that 



f (*) 2 + f 1 M 2 + ■ • ■ + f n M 2 is stable 
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We can also state Lemma \2 1,621 in terms of Wronskians: if f < g in P pos 
then the Wronskian W(f, g) = f g' — f 'g is stable. 
More generally we have 

Lemma 21.64. The square determined by jl + xDi + yD 2 + ZD3I, where all Dt are 
positive definite, Di is diagonal, and D2, D3 have all positive entries, yields a stable 
polynomial. 

Proof. The assumptions mean that we are to show that | * 9 | is stable, where 

f + gy + hz + kyz + ■ ■ ■ = |I + xD a + yD 2 + zD 3 | 

and all matrices are positive definite. Assume that Di is diagonal (r-j.), D2 = 
(aij), D 3 = (btj). We have 



f 
9 



n( i+r ^ 

22 a u f/(l + n%) 



h = ^b,jf/(l+TjX) 



k = 



gh-fk = f 2 Y_ 



Qiib 



f 



(1+TiXHl + TjX) 



(1 



bit y- aijbij \ 

TiX) 2 (1+T t x)(l+Tjx) J 



Since Di,D2, D3 are positive definite, their diagonals are positive, so (gh — 
fk) / f 2 lies in the lower half plane by Lemma [21. 291 Lemma [2 1 .531 implies that 

f 2 ~ gh — fk, and therefore gh — f k is stable. 

□ 

Corollary 21.65. Iff, g, h, k are as above, and < a < 1 then is stable. 

Proof. The above computation shows that 

gh - afk = gh - fk + (1 - a)fk = f 2 (V " ibi + (1 - a)^ 

\*— (1 + T|X) 2 f 

From Lemma|2T33]we know that if we evaluate at a point in the first quadrant 
then all the terms lie in the lower half plane. □ 

We can slightly improve Lemma [21. 621 

Corollary 21.66. Iff e P pos and < a s$ 1 then \ f, /,' | is stable. 

Proof. Applying the calculations of the lemma to 

f (x + y + z) = f (x) + f'(x)u + f '(x)z + f "(x)yz/2 + • ■ • e P? 05 
shows that (a/2)ff " - f'f is stable. □ 
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Newton's inequality says that a certain determinant is positive. If we intro- 
duce a second variable, this becomes a statement that a polynomial is stable. 
The first part follows from Lemma [21. 641 



Lemma 21.67. If I = £ f i {x)y i is in Pf s then 



+2 
T 

fofi 

f2 

fofi 



H 

H 
H 
H 



f? - fof 2 
f? - fof 2 

X(f? - f f 2 ) 

x(f? - f f 2 ) 



in ?Ci 
in Jfi 
in Mi 
in Jfi 



7n particular, f \ — fof 2 is (weakly) stable, and all coefficients are non-negative. 

Proof. We compute i\ — f(jf 2 using the determinant representation for f. So, 
assume that f = |I + xDi + yDal- We may assume that Di is diagonal, Di = 



diag(ri), and that D 2 



Then 



f2 . 



1 



fo(x) 

f 2 (x) 

-fof 2 
-fof 2 



TiX 



V , fo 



d« 



L 

^ d "(l + xn) 2 



fo 



f2 
T 



nx)(i + Tjx) 

h^td^+dudjj) 



f2 



(1 +XTi)(l +XTj 



fZ 



XTi 



diidjjj 



x<3 



- (l + xri)(l-|-xrj 



Now the diagonal entries of a positive definite matrix are positive, so all the 
coefficients of the expansion of i\ — fof 2 are positive. Consequently, evaluation 
at a point in the closed first quadrant yields a point in the open lower half 
plane, which establishes the first part. 
For the second interlacing we have 



f2 . 
r l 



fnf 



0T 2 



fofi 




dfj + dudjj 



xr t )(l 




When we evaluate a point in the closed first quadrant, fo/fi is in the first 
quadrant, and the part in the parentheses is in the lower half plane. Conse- 
quently, their product lies in an open half plane P (the rotation of the lower 
half plane by the argument of the point) which misses the negative real axis. 

Since f \ — fof 2 is stable, all of its coefficients have the same sign. If we 
let x = then this is positive, by Newton's inequality for f(0,y). Since the 
constant term is positive, all terms are positive. 
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When we introduce the factor of x, then the sum in parentheses for 
x(f 2 — fo^l/'fo li es i n the right half plane by lemma 121.291 The case where 
the denominator is fofi yields a point in the product of the first quadrant and 
right half plane, and so again misses the negative real axis. 

□ 

If we choose (3 > then fo([3),fi(|3),f2((3) are the first three terms of a 
polynomial in P pos , so we know that fi((3) 2 > 2f (|3) f 2 (|3). This suggests that 
we can improve the result above. 

Corollary 21.68. If < a < 2 and f, ft are as above then 

1. f \ — afof2 is stable. 



2. 



'k + l 'k+2 
fk fk+1 



zs stable. 



Proof. For the second, we differentiate k times and apply the first part. Using 
the notation of the lemma, the first part follows from the identity 

f 2 - txfpfg _ 



. (l+xn) 2 f-r (l+nx)(l+T 5 x) Z - (l + nx)(l +tjx) 



□ 



We can use the lemma to construct stable polynomials from polynomials 
inP pos . 

Corollary 21.69. Ifi € P pos and a,b,c> then af 2 + bff + cx(ff" - (f) 2 ) z's 
stable. 

Proof. From Lemma |21.67| we know that each of the three terms positive inter- 
lace f 2 , so any positive combination is stable □ 

The next lemma arises from considerations involving the Euler polynomi- 
als. Unlike the last lemma, we must look carefully at the terms of the quotient. 

Lemma 21.70. //f € P pos (n) then 

Q = x (uf 2 + (x - l) 2 [ff ' + x(ff " - f ,2 )] JeM! 
Proof. We show that Q and f 2 interlace positively. First, 

Q/f 2 = x( n + (x - 1) (f '/f ) + xfff " - f ,2 1 /f 2 
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If f = T~[ [x + Ti) where all Tt are positive then 

cl2 f-cii 



f _ y 1 f 11 - ff " _ y \_ 

Thus the quotient satisfies 

Q/f 2 =x(u+(x-l)Y— -xV" r - — 

Z/ x — 1 X 
1 + -. 
V X + Ti (X + T t 



x — 1 X 

, , n) 2 

_ y XTj(l + Tj -fx] 

If we substitute <x + |3i for x then 

XTitl + Tj + x) _ 

rt(|3 4 + (2a 2 + 3r i ct+a + r i (r i + 2)) |3 2 + a(n + a) 2 ^ + a + 1)) 

In + a\ 2 

Thus, the sum is in the first quadrant if oc + fii is in the first quadrant. □ 



21.9 Linear transformations 

Linear transformations preserve linearity, so the following is immediate. 

Lemma 21.71. Ifl:0i-[ — ► "Ki preserves degree and the sign of the leading coeffi- 
cient then T preserves interlacing and positive interlacing. 

Lemma 21.72. If f, g £ 5fi then f * g e JCj. 

Proof. If f € pP° s then the method of generating functions shows that the 
Hadamard product maps P pos x Jfi — ► 3fj. However, we need a different 
approach when f ^ pf os . There does not appear to be a direct way to do this. 
We follow [66]: convert to Ui, take Hadamard products there, and use the fact 
that f < g and h< k implies f * h< g * k. 

□ 

Lemma 21.73. If J: x n — ► (x) n then T maps % x to % x and "Ki{C] to Jfi(C). 

Proof. It suffices to show that T maps "K\ (C) to itself; we proceed by induction. 
So assume that f e 5fi(C) and T(f ) e Jfi(C). If a € RHP then will show that 

T(x + a)f ~ T(f ) which implies that T(x + a)f e 5fi(C). From f77/fl we know 
that 

T(x + a)f = (x+a)T(f)+T(xf). 
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If cr e Qi then 

T(x + a)f , T(xn . 

Now T(xf') ~ T(f) so 4ffr(^ is in C\(0,oo). Thus T %" ]i (o) is in C\ (0,oo) 
and the lemma is proved. □ 

Lemma 21.74. If J: x n — > (x) n f/ien T^ 1 maps "Mi to"K\ and"Ki{C) toJCi(C). 

Proof. We proceed as above. Let S = T _1 . From | |7.6.1| | we have 

S(x + a)f = (x+a)S(f) +x(S(f))' 

ToshowS(x+a)f ~S(f) we choose cr e Qi and then 

Since S(f) E 5Ci(C) by induction and S(f) <(S(f)) ' we have (S(f))'/S(f) (cr) € 
RHP and therefor dlTO e RHP + RHP x RHP = C \ (0,oo). □ 

{-.TV/2 p7)pfl 
thenl-.'Kx — >P vos . 
nodd 

Proof. If f e 5fi has degree n and we write f (x) = f e (x 2 ) + xf D (x 2 ) then 

i n (f e (-x 2 ) + (-ix)f (-x 2 )) eU 

Thus, f e and f are in P, and so T(f ) = f e since all coefficients are positive. □ 

The Charlier polynomials are defined by 

=0 C a = 1 
C£ +1 (x) = (x - n - a)C«(x) - anC*_,(x) 

Lemma 21.76. If a e RHP and T: x n i-> C£ then J- 1 maps'K 1 {C] to ^(C), and 
J£i to 3xi • 

Proof. We proceed as with the rising and falling factorials. We show by induc- 
tion that for a e RHP we have that T _1 (x + a)f ~ T _1 (f ). From the recurrence 

T- 1 (x + a)f = (x + a + ajT^f + (x + a) (T" 1 (f )) 
we choose cr e RHP and consider the quotient 

T _ 1(f) (g) ^(g+a+a) + (cr+a) V T _ ltf) ; (a) 

We see that the quotient lies in RHP + RHP x RHP C C \ (0, oo) since a, a and 
a are all in RHP. □ 
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If we have two mutually interlacing sequences then the sum of products 
in reverse order is in P (See p. [66]). If we use the original order we get a stable 
polynomial. 

Lemma 21.77. Suppose that fj, T2, • • • , f n and g\, g2, • • • , g n ore two sequences of 
mutually interlacing polynomials in p pos . Then 

figi + ^2 h h fngn e "K\ 

Proof. If 1 < k ^ n then f i <C f k and gi <|C g^. If cr is in the closed first quadrant 
then (cr) and ^ (cr) are in the open fourth quadrant. Thus (a) are in the 
open lower half plane. Addition shows that 



1 

i 

k=l 

which finishes the proof. □ 



1 

7 / fkflk is in the open lower half plane 

f i9i t—i 



We can apply this lemma to integrate families of interlacing interlacing 
polynomials. We also view these results as linear transformations on P sep . 



Lemma 21.78. If f, g G P sep then 



f (x + t)g(x + t) dt is stable. 

o 



Proof. The proof is the same as Lemma 18.601 except we use the preceding 
lemma instead of Lemma |3.16l □ 



Corollary 21.79. Iff e P sep then 



e l f (x + t) dt is stable. 



Proof. Apply the lemma to e x and f, and then factor out e x from the result. □ 

Example 21.80. If we take f e P sep then we see that \\ f (x + 1) 2 dt is stable. In 
particular, 

r i 

((x) n ) dt is stable. 

o 



Lemma 21.81. //f e Mi then 



f(x + t) dte J£ a . 

o 



Proof. Suppose deg(f) = n, let g(x) = i n f(— ix), and h(x) = J f(x + t) dt. 
Since g(x) € 3 we have from Lemma[2U39] 



■l 

g (x + it) dt € 3 and therefore 

o 



■l 

i n f(-ix + t) dt e 3. 

o 



The last integral is i n h(— ix), so h e "K\. □ 

It is surprising that we do not need to make any other assumptions about 
f in order that the difference is stable. 
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Corollary 21.82. Iff e "K x then A(f ) = f (x + 1) - f (x) e "K x . 
Proof. Since f ' € JCi we have 



f(x + l)-f(x) = 



f(x + t)'dt e Mi. 



□ 



Example 21.83. Here's a simple example. We know that x n e "K\, and the 
roots of Ax n = (x + l) n — x n are 



1 sin^ 
"2 _l -2 + 2cos2# 



k= l,...,u 



They all lie in the open left half plane, so Ax n 6 Jfi . 



21.10 The totally stable matrix conjecture 

Recall that if f (x) = Y. ^ is 1x1 pP ° S then the matrix 



/a ai a 2 a 3 a 4 a 5 a 6 

ao ai a 2 Q3 a 4 as 

ao ai Ol2 a 3 a 4 

ao ai a 2 a 3 



V : 



(21.10.1) 



is totally positive. This means that all minors are non-negative. Now, assume 
that f (x,u) = Y. fi( x )y x an d form the matrix 



cp(f)(x,y) 



,pos 
- 2 





/fo 


fl 


h h U is 


h 


...\ 







fo 


fl f2 h f4 


f 5 












fo fl f2 f3 


f 4 












f fl f 2 


f 3 






{'■■ 










a 


> 


we 


know that f(a, y 


) G 


ppo 



(21.10.2) 



<p(f)(<x,u) are positive. Now, whenever we have a polynomial that is posi- 
tive for x > we should consider whether it is stable. Empirical evidence 
suggests the following: 

Conjecture 21.84 (The totally stable matrix conjecture). If f (x,u ) e F^ s then all 
minors of the matrix J21.10.21l are stable. 

Here's a simple consequence: 

Lemma 21.85. If f € Pj 08 and all minors of cp(f) are stable, then all minors have 
non-negative coefficients. 
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Proof. If g(x) is a minor then since g is stable we know that all coefficients 
have the same sign. If we set x = a ^ then the matrix cp(f (a,y)) is just 
the matrix of a one variable polynomial in P pos and we know that all of its 
minors are non-negative. Thus, g(<x) is non-negative for all positive a. Since 
all coefficients are the same sign they are positive. □ 

Example 21.86. For a simple example, consider cp(l + x + u) 4 = 



/(1 + x 




V '■ 



i 4 4(1 +x) 3 6(1 

(1+x) 4 4(1- 

(1- 




x) 2 4(1 +x) 1 

■x) 3 6(1 +x) 2 4(1 + x) 1 

+ x) 4 4(1 +x) 3 6(1 + x) 2 4(1 + x) 

(1 + x) 4 4(1 + x) 3 6(1 +x) 2 



•7 



Every minor is a power of (1+x) times a minor of the coefficients, which we 
know to be positive, so all minors are stable. 



Example 21.87. Lemma [21.67l shows that the 2 by 2 determinants 
are stable. 



fk+1 fk + 2 



Example 21.88. If f(x) e P pos then f(x + y) e P^ 08 , and the entries of the 
matrix come from the Taylor series. 



cp(f(x + a)) = 



We know that some of the two by two determinants are stable. For example, 

1 A , 





fin 


f< 2 '/2! 


f 


3 V3! 


f(4 


/4! 


f(S 


/5! 


f (6) /6! 


...\ 





f 


fW/l! 


f 


2 '/2! 


f(3 


/3! 


f(4 


/4! 


f' 5) /5! 










f 


f 




f(2 


/2! 


f(3 


/3! 


f(4) /4 j 















f 


f(l 


IV. 


f(2 


/2! 


f< 3 '/3! 




V 




















•J 



f(3)/3! f (4) /4 , 
f(4) /4 J f (5) /5 . 



4!4!^5 99 



g'g') 



where g = f' 3 '. Since 4/5 < 1 Corollary |21.66| applies to show that it is stable. 

Remark 21.89. If f = Y. a i xX i s a stable polynomial then f e <f Q . Proposi- 
tion |3]22] therefore implies that 





Q 2 


d4 






\ 





Ql 


a 3 


Q5 














Q 2 


a 4 • • • 













ai 


a 3 a 5 





V 



is totally positive. This matrix is known as the Hurwitz matrix [7J. 
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Earlier (See p. 13001 ) we saw that the matrix of coefficients of a polynomial 
in Pj 05 is totally positive2, but there are examples where some three by three 
determinants are negative. However, there is a surprising conjecture that is 
not fazed by this fact. 

Conjecture 21.90. If Di, D2, D3 are positive definite matrices and M z's the matrix 
of coefficients of y,z of |I + xDi + y D2 + ZD3I then all minors of M are stable 
polynomials in x. 

For example, if we use the same example with a particular choice for D 1 



A 0\ /0 0\ /13 9 7\ /5 7 8\ 
M= 1 +x 1 +y 9 7 5 +z 7 11 12 
\0 1/ \0 0/ ^7 5 4/ \8 12 Uj 

then the coefficient matrix is a 4 by 4 matrix, and the non-constant 3 by 3 
minors are 

-1620x 3 - 7560x 2 - 4884x + 1760 -1080x 2 - 3512x - 4544 -768x-4096 
-1080x 2 - 3496x - 3376 -720x - 1312 

-744x - 2728 

All entries are stable polynomials. Note that if we substitute zero for x we get 
the example on page 13001 



21.11 Determinants of Hankel matrices 



If Pi/P2/ • • • is a sequence of polynomials then the d by d Hankel matrix (See 
p. 12941 ) of this sequence is 



H({pi};d) = 



/Pi 


P2 


P3 


■■ Pd \ 


P2 


P3 


P4 


•• Pd+1 


P3 


P4 


P5 


•■ Pd+2 



P2d / 



\Pd Pd+1 Pd+2 

In this section we study properties of Hankel matrices. This includes the kind 
of polynomial (all real roots, stable, etc.), the sign of the coefficients, and in- 
terlacing properties. 

A polynomial sequence {p n (")}n^o is called q-log-concave if all coefficients 

of 



VnVn+2 — Pn+1 



= H[{p n ,p n+1 ,p n+2 };2] 



Pn Pn+1 
Pn+1 Pn+2 

are positive for n 0. If all coefficients are negative it is q-log-convex 11181 . 
A stronger condition is that the polynomial p n+1 — p n Pn+2 is stable. For 
instance, Lemma [21.671 shows that the coefficients of a polynomial in are 
q-log-convex. 
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Lemma 21.91. IfB n is the Bell polynomial then B n+ 2B n — B^ +1 is weakly stable 
with positive leading coefficient. In particular, it is q-log-convex. 

Proof. The Bell polynomials satisfy the recurrence B n+ i = x(B n + B' n ), and 
therefore 

B n +2B n — B 2 , i = x ( B 2 + B n B' + x(B n B" — B T 



Since B n e P pos it follows from Corollary I2L691 that this is in !Ki . □ 

In addition, we have shown that B^ ~ B n B n+2 — B^ +1 . 

Lemma 21.92. If A n is the Eider polynomial then A n+2 A n — A^ +1 is weakly stable 
with positive coefficients. 

Proof. The Euler polynomial satisfies the recurrence 

A n+1 =x((n + l)A n + (l-x)Aj l ) 
and so substitution yields 

A n+2 A n - A 2 +1 = x (jl + n)A 2 n + (x - l) 2 [A n A^ + x(A n A^ - (A;) 2 ] 

This sum is the same as the sum in Lemma [21.701 except that it has n + 1 in 
place of n. This only adds a point in the first quadrant to the quotient, so we 
conclude that A n+ 2A n — A^ +1 is stable. The value at is known to be positive 
1118 j, so all terms are positive. Since A n has for a root, we only get weakly 
stable. □ 

The following general result is easy, since everything factors. The work is 
finding the constant. 

Lemma 21.93. Ifn and d are positive then |H[{(x) n , (x) n+1 , . . . }; d]| is in P. 
Proof. We first prove the result for d = 3. If u = 1 and d = 3 then the matrix is 

x x(x + l) x(x + l)(x + 2) 

x(x + l) x(x + l)(x + 2) x(x + l)(x + 2)(x + 3) 

x(x + l)(x + 2) x(x + l)(x + 2)(x + 3) x(x + l)(x + 2)(x + 3)(x + 4) 

with determinant 

2x 3 (x + l) 2 (x + 2) 

If we let f n (x) be the determinant of H[{(x) n , (x) n+1 , . . .}; 3] then it is easy to 
see that 

frvM = <x) n (x) n+1 (x) n+2 x 

11 1 

x+n x+n+1 x+n+2 

> + n)(x + n + l) (x + n+ l)(x + n + 2) (x + n + 2) (x + n + 3) 

= 2(x) n (x) n+1 (x) n+2 



CHAPTER 21 . STABLE POLYNOMIALS 



652 



In general, the answer is 

(d-l)!fd-2)!---l! 



'n+l 



'n.+ d-l 



It's easy to find the polynomial factors. If we remove them and replace x + n 
by y we are left with 



( u + d-l ), 



From II108II we see that this equals the Vandermonde determinant 



y + d-l 



(y + d-l) d - 1 



which equals 

II (Ey+j]-Ey + i])= n (j-^ 

0<i<j<d 0<i<j<d 

and this is precisely (d — 1)! • ■ ■ 1!. 



□ 



Example 21.94. The Chebyshev T polynomials have trivial Hankel determi- 
nants. First of all, 



T n T n+ i 
T n +i T n+ 2 



T n T n+ i 
2xT n - T n _i 2xT n+1 - T-n 



T n -i T n 
Tn T n+ i 



Continuing, we find they are all equal to x 2 — 1 . Thus, all the Hankel matrices 
H[{T n }; 2] are equal to x 2 — 1, so they all have all real roots. 

Since T n+ i = 2xT n — T n _i has coefficients that do not depend on n, all the 
higher Hankel determinants are zero. 

We end this section with some empirical results about the Hankel determi- 
nants of the form H({p n }; d]. "cpx" means roots in all quadrants, and / means 
it has been proved. 
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family 


a 


Kind of polynomial 


Sign of coefficients 


Bell 


2 


stable / 


positive / 




5*3 


cpx 


positive 


Euler 


2 


stable / 


positive / 




> 3 


cpx 


positive 


Narayana 


2 


stable 


positive 




3 


stable 


positive 




>3 


cpx 


positive 


Laguerre 


2 


stable 


negative 




3 


stable 


negative 




4 


stable 


positive 




n 


stable 





21.12 Constructing stable polynomials 

We construct matrices whose determinant is a stable polynomial. Recall that 
a matrix A is called skew-symmetric if A T = —A. If A is skew-symmetric then 

1. The eigenvalues of A are purely imaginary. 

2. If D is positive definite, then the eigenvalues of A — D have negative real 
part. 

Using the latter property, and the fact that f(x) e Jti if and only if 
i n f (— rx ) e 3(n] we have we have 

Lemma 21.95. If Di, ... are positive definite, S is symmetric, and A is skew- 
symmetric then 

1. |l + xiDi + ---+x d D d +iS| e 5£i(C) 

2. |l + xDi + --- + x d D d + A| e "Kd 
For example, the determinant of 

(o l) +X G 2) + (-l o) 

is in J£ 2 and equals 

2x 2 + 8yx + 3x + 5y 2 + 5y + 2. 
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Remark 21.96. It's easy to construct a determinant representation ct|I + xD + A| 
of f € JCi, where a > 0. Factoring f, it suffices to show this for polynomials of 
degree one and two - the general case follows by taking direct sums. 



x + a = a|(l)+x(l/a) + (0)| 



(x + a) 2 + b 2 = a 2 



1 
1 



+ x 



1/a 
1/a 



+ 



b/a 
-b/a 



Example 21.97. A special case of the matrix xl + D + A is the Jacobi-like matrix 



( x + ai bi 

— bi x+ai si 

-si 







V i ; : 



x + ai b2 

— b2 X + Q2 







S2 



... \ 



-S2 



X + a 3 b 3 







-b 3 



x+ a 3 



whose determinant is stable. The determinant of the submatrix consisting of 
the first n rows and columns is an n'th degree polynomial p n . The recurrence 
relations for p n are 

P2k = (X + die] P2k-1 + b 2 . p 2 k-2 
P2k+1 = (X + Q k +1 ) P2k + S 2 . p 2 k-l 



If all the Qi are positive then we know from the Lemma above that all p^ are 
stable ; this also follows from the recurrence of Corollary l21.27l Moreover, the 
corollary shows that they interlace. 

If a polynomial f (x) is in P pos then all its roots are negative, so f (x 2 ) has 
purely imaginary roots. There is a similar result for P!? 05 '. 

Lemma 21.98. I/f(x,y) G F p 2 ° s theni(x,x 2 ) G 5C X . 

Proof. Since f has real coefficients it suffices to show that f (cr, cr 2 ) ^ where 
cr is in the first quadrant. If so, then a 2 is in the upper half plane, and hence 
f(cr, a 2 ) ^ since f G U 2 . 

Alternatively, this is a consequence of the next lemma. □ 

Lemma 21.99. If Di and D2 are positive definite then |I + xDj + x 2 D2| is stable. 

Proof. If cr = a + bi is in the first quadrant then 

|I + ffDi + cr 2 D 2 | = |(I + aDi + (a 2 -b 2 )D 2 ) + i(bD a + 2abD 2 ) | 

The coefficient of 1 is positive definite, so the determinant is non-zero. 

□ 



CHAPTER 21. STABLE POLYNOMIALS 



655 



21.13 Stable matrix polynomials 

A hyperbolic matrix polynomial M is determined by the properties of the fam- 
ily of polynomials v* Mv. If we posit that all the quadratic forms are stable we 
get stable matrix polynomials. 

Definition 21.100. An n by n matrix polynomial M is stable if v t Mv is stable 
for all non-zero v e K n , and the leading coefficient is positive definite. 

Most properties of hyperbolic matrix polynomials carry over to stable ma- 
trix polynomials. For instance, we define 

Mi M 2 iff v*Miv v*M 2 v Vv e K n \ 

Mi ~ M 2 iff v*Miv ~ v*M 2 v Vv e K n \ 

The following lemma is easily proved using properties of stable polyno- 
mials. 

Lemma 21.101. Suppose that f, g, h are stable matrix polynomials, and k e JCi. 

1. All elements of Hyp\ os are stable matrix polynomials. 

2. f < — g ifff + erg is a stable matrix polynomial for all a e RHP. 

3. Iff g and f h then f g + h. In particular, g + h is a stable matrix 



polynomial. 

Iff ^ g <- 
5. fk is a stable matrix polynomial. 



4. Iff *iL g h then f + h^-g. 



6. If all at are positive then (aixi + • • • a^x^ + b)f <-^- f. 



7. If the i-th diagonal element off is f ir and o/g is gt then f <§C g implies ft <C gi- 

dx 



8. is a stoWe matrix polynomial. 



Q r h df 
^- 1 dx 

20. //f^gfen£^^ 

We can construct stable matrix polynomials using positive interlacing. We 
start with the data 

positive real numbers ai, bi, . . . , a n , b n 
positive definite matrices mi, • • • , m n 

We claim that 

n 

M(x)^m t n(x+ Qi ) 2 +b? 

k=l i^k 
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is a stable matrix polynomial. In order to verify this we first define 

g(x)=n( x + a ^ 2 + b ? 

i 

and we know that 
Next, 

v t M v = ^(v t m lc v)-— 

^- (x + a k ) 2 + b 2 

Since v t rakv is positive, it follows that g v'Mv since each of the terms 
positively interlaces g(x), and positive interlacing is closed under positive lin- 
ear combinations. 

More generally, we can also construct stable matrix polynomials using 
common interlacing. 

Lemma 21.102. If fj, . . ., f T have a common interlacing in "K\ and mi, . . . , m r are 

positive definite matrices then 

f i mi + • • • + f T m r is a stable matrix polynomial. 

Proof. The leading coefficient is a positive linear combination of positive defi- 
nite matrices and so is positive definite. If g < — f| then 

v*gv = (v t v)g ( v * nu v)f| = v t fi.rn.iV 

and therefore 

v t g v < — v 1 (f imi + • • • + f T m T ) v. 

□ 

A quadratic polynomial is stable if all its coefficients are positive. The 
analogous result holds for matrix polynomials. 

Lemma 21.103. If Mi, M 2 , M 3 are positive definite matrices then the matrix poly- 
nomial Mi + x M 2 + x 2 M 3 is a stable matrix polynomial, and its determinant is 
stable. 

Proof. We must verify that v l ( Mi + x M 2 + x 2 M 3 ) v is stable for all non-zero v e 
Now all the coefficients of (v t M 1 v) + x(v t M 2 v) + x 2 (v t M 3 v) are positive 
since the M^ are positive definite, and , so this is a stable polynomial. 

If A is a root of | Mi + x M 2 + x 2 M 3 1 then A ^ since Mi has positive 
determinant. There is a non-zero vector v in C H such that 

(Mi +AM 2 + A 2 M 3 )v = 
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Recall that if v* is the conjugate transpose, then v* M v > for non-zero v and 
positive definite M. Multiplying on the left by v* yields 

(v* Miv) +A(v* M 2 v) +A 2 (v* M 3 v) =0 

Since this is a quadratic with positive coefficients, it is a stable polynomial, 
and so A lies in the left half plane. □ 

Surprisingly, powers such as (xl+ A) d are generally not stable matrix poly- 
nomials. We do have 

Lemma 21.104. If A is positive definite, and B is positive semi-definite then (xl + 
A) 2 + B is a stable matrix polynomial, and generally not hyperbolic. 

Proof. First take B = 0. If v € K.^ is non-zero then all coefficients of 

x 2 (v* v) + x(2v t A v) + (v* A 2 v) 

are positive, so (xl + A) 2 is a stable matrix polynomial. Next, we show that 
the discriminant 4( (v t A v) 2 — (v* v) (v* A 2 v) ) is never positive. Define an inner 
product < v, w >= v l w. Then 

(v* Av) 2 - (v t v)(v t A 2 v) =< v, Av > 2 - < v,v >< Av, Av > < 

by the Cauchy-Schwartz inequality. If B is positive semi-definite then the dis- 
criminant is 

4((v t Av) 2 - (v^Mv 1 A 2 v)) -4(v t v)(v t Bv) 
which is still non-positive. □ 

There is a similar result for cubics, but we need a condition on the eigen- 
values to get stability. 

Lemma 21.105. Suppose A is a positive definite matrix with minimum and maxi- 
mum eigenvalues A m i. n and A max . If ^ max < 9 then (xl + A) 3 is a stable matrix 
polynomial. 

Proof. We need to show that if v is any non-zero vector then 

x 3 (v'v) +x 2 (3v* Av) +x(3v l A 2 v) + (v* A 3 v) 

is stable. Since all the coefficients are positive, by Lemma [2L38] this is the case 
precisely when 

{3v l Av)[3v t A 2 v) 



(v*v)(v* A 3 v) 



^ 1 



Now this is equivalent to 



(v^v) (v*A 2 v) 1 
x — — = — - > 



(vtv) (v t A 3 v) 9 

The first factor lies in [A m t n , A mQX ], and the second lies in [1/A max , 1/A m t n ], 
so their product lies in ^ min , ^ max . By hypothesis A m i n /A max ^ 1/9, so the 
proof is complete. □ 
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The ratio A mQX /A m t n is known as the condition number of A, so we could 
restate the hypothesis to say that the condition number is at most 9. 



Chapter 

22 



Transformations in the complex 
plane 



In this chapter we are interested in polynomials whose roots might be com- 
plex. If § is a region in the complex plane, then P s means all polynomials 
whose roots all lie in S. We will give some examples of linear transformations 
T and regions I, J such that T: P 1 — ► P J . 

It is difficult to describe the behavior of a transformation on all polyno- 
mials, so we sometimes restrict ourselves to a special subset of polynomials. 
Define the function 4> : C i— > P (n) by z i— > (x — z) n . Given a transformation T 
we are interested in determining the image T 4>- In the best case we can find a 
a simple function <$>' so that the diagram commutes: 

cb: zi-^ ( x— z) n n 

C — — -P c (n) (22.0.1) 

*' 



Given a set 7 of polynomials we are also interested in all the roots of poly- 
nomials in 7. We extend the meaning of roots(f) to a set of polynomials. 
Define 

roots(T) ={Ce C I 3f e TAf(0 =0} 

If S is a region in C, and T is a transformation, then since 4>§ c P s we know 
that 

roots(T(cj>S)) c roots(T(P s ) ). (22.0.2) 

We are interested in situations when 1 122.0. 2|l is an equality. See Question 12051 
We generalize the Hermite-Biehler theorem that characterizes polynomials 
whose roots are in the lower half plane in terms of interlacing properties of 
their real and imaginary parts. 
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The closed disk of radius r centered at cr is A(cr,r); the unit disk is A, and 
the boundary of the unit disk is the unit circle 3A. 

22.1 The derivative and interlacing 

We recall the Gauss-Lucas theorem and prove a converse. 

The fundamental result about the derivative is the the Gauss-Lucas theo- 
rem which states that the roots of the derivative of f lie in the convex hull of 
the roots of f . It follows that 

Theorem 22.1. If § is a convex subset ofC then the linear transformation f i— > f ' 
maps P s to itself. 

The converse is also true. 

Proposition 22.2. If T is a linear transformation with the property that T: P s — ► 
P s for all convex regions S, then Tf is a multiple of some derivative. 

Proof. We first choose S to be the single point 0. It follows that T(x n ) = a n x en , 
since T(x n ) can only have as a root. Ignoring the trivial case, there is some 
non-zero a^. Let s be the smallest integer so that a s ^ 0. Next, consider 
§ = {-1}. Since (x + l) s e P s and T(x + l) s = a s x e % it follows that e s = 0. 

Consider S = {—A}. The only members of P s are multiples of powers of 
(x + A). We compute T(x + A) r in two ways: 

T(x + A) r = |3 r (x + AP' = (3 T ^ mr WA"^ 



j=o x ' 

eQt ( ,'»~ - !(;)«»• 

k=0 v 7 k=0 v ' 



■ r-k 



Since these are identities in A that hold for infinitely many values of A, the 
coefficients of powers of A must be equal. From consideration of the coefficient 
of A T ~ k we find 

r ^a k x ek =|3 r ( mr ) x ™r-r+k (22.1.1) 



v k / \m r — r + k 

Choosing k = s shows that (3 T ^ for r s. Consequently, ^ for 
k ^ s. Substituting r = k shows that (3 T = a r . If we take r ^ s then comparing 
exponents shows that = m r — r + k. It follows from this that = k — c 
for some constant c, and k > s. Since we know that e s = 0, it follows that 
T(x k ) = ai<x k ~ s for k s and is for k < s. In addition, m r = r — s. 
Substituting this into 422,1.11 yields 

r > s 



s! (t-s)1 

r < s 



which implies that T(x r ) = a s D s x r , as promised. 

□ 
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Every point of § is a root of a derivative of a polynomial supported on the 
boundary 

Lemma 22.3. If § is a convex region ofC, and D is the derivative, then 

roots( D(P a$ )) =§. 

Proof. If ff,T e 3§ thenf(x) = (x-ct)(x-t) e P as . Since f = 2x- (u + T),we 
see that |(ff + x) S roots(DP as ). Since S is convex, the set of all midpoints of 
pairs of vertices on the boundary equals all of §. 

□ 

22.2 Properties of algebraic Interlacing 

In this section we consider some questions about the location of the roots of 
algebraic interlacing polynomials. 

Definition 22.4. Supposef(z) = cr(z — ri) • ■ • (z — r n ). We say thatf A g if and 
only if we can write 

gW=fW (-»_ + ... + -*_) 

\x-vi x-rxj 

where pi, . . . , p n are all real and non-negative. 

It is immediate from this definition that if f , g £ P then f < g iff f A g. For 
any f we have that f A f since we know 

f( X ) = f(x) (— l — + ... + — LS) . 

\x — ti x — r n J 

The Gauss-Lucas property holds for this definition of interlacing, and the 
proof is nearly the same, see 111211 Theorem 6.1']. 

Lemma 22.5. Suppose that § is a convex region, and f e P dS . Ifi A g then g e P s . 

We now extend the Gauss-Lucas Theorem by showing that every possible 
point is a root of some interlacing polynomial. Let Hull(£,i, ■ ■ ■ / Cn) denote the 
convex hull of the points (d, . . ., Ca)- If these points are the roots of f, then we 
write Hull(f ) = HuU(d, 

Proposition 22.6. Ifi is any polynomial, then every point in Hull(f) is a root of 
some g where f A g. Consequently, 

Hull(f) = roots{{g |f < g}) 

Proof, lemma 12231 shows that roots({g | f < g}) C Hull(f). Let C = 
Ki,...,Cn} be the roots of f. We first show that if e Hull(f) then G 
roots ({g | f ^ g}). For |3 = (|3i,. . ., |3 n ) we define 

n 

f p = 7" (3k— ^ (22.2.1) 
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The requirement that f p (0) =0 implies that 



= ±Cl • • • Cn 



Pi , Pr 



Thus, it suffices to see that is in the convex hull of . . . , 1/C n }, which is 
clear, since is in the convex hull of {Ci, • . • , C n }- 

Choose a in the convex hull of f . Define the Blaschke (see last section) 
transformation 

___ z— a 
I ! z I ► 

.\ z 

Consider f p (z) and the transformed polynomial g p (z) = (1 — az) n f p (T z). If 
|3 is chosen so that gp(0) = then fp(a) = 0. So, we need to know that if 
a is in the convex hull of the U then is in the convex hull of T£|. But since 
a e Hull(f ) there are Yi so that 

and so 

= T(a)=^^ T ( z ^ 
Thus, is in the hull, and we are done. □ 

This last proposition showed that there are no restrictions on the locations 
of roots of polynomials interlacing a given polynomial. However, there are 
restrictions on the simultaneous location of all the roots of a given polynomial 
interlacing f. Write f (x) = (x — rj) • • • (x — r n ), and define 

fp (Z = > Pi 

* — Z — Ti 

Letting si, . . . , s n _i be the roots of f p, and substituting z = r k we have 

and since Pk is positive we have 

Y_ arg(r k - s t ) = ar g( r k - Tj ) 
i j^k 

This says that the sum of the angles (mod 2n) formed at Tk by the roots of f p 
is a fixed quantity. We can therefore derive restrictions such as 

The roots of f p can not satisfy 
sC arg(r k - sO < — — Y_ ar g( r i ~ r k) for 1 s$ i n - 1 
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For example, if n = 3 this restriction becomes: 

The angles at a root of f formed by the two roots can not both be 
less than half the angle at the vertex. 

In the picture below, it follows that f p has one root in each triangle marked a 
(or one each in b, or one each in c). The interior lines are the angle bisectors. 




Figure 22.1: Where the roots are. 



22.3 The map cp 

When we have a formula for T(x — z) n we can use this information to deter- 
mine a map 4>' so that (|22.0.11 is satisfied. We begin with multiplier transfor- 
mations. 

Example 22.7. Multiplier transformations have a very simple action in the 
complex plane. Suppose that T is a multiplier transformation. We have a 
fundamental commutativity diagram that is immediate from the definition of 
multiplier: 



Lemma 22.8. Suppose that T is a multiplier transformation x k <ikX k and 
n, . . . , r n are the roots of T(x — l) n . The definition of§' below makes 1 122.0. 1|| com- 
mute. 

k 

Proof. IfzeC, then T(x - z) n (x) = implies that T(| - l) n (x) = 0. Since 
T is a multiplier transformation this is equivalent to T(x — l) n (^) =0, which 
implies the result. □ 

Example 22.9. If we use the formula 117.10. 4|l for Laguerre polynomials with 
the transformation T(x k ) = L^[x) we get 

T(x + yr = (y + irT(x-H-^-) (22.3.1) 



y + i/ y + i 
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and by linearity yields 

If we substitute ot[y + 1) for x in ( 122.3. lb then we see that 

T(x + y) n (a(y + l)) = [y + l) n T[x n ){oc) = [y + l) n L n [oc) 

and that therefore the roots of T ( ( x — y ) n ) are given by a ( 1 — y ) , where a is a 
root of L n . In particular, it follows that 

Lemma 22.10. Suppose T(x n ) = L n [x) has roots r\, .. .,r n . The following defini- 
tion of 4>' makes 1 122.0. lb commute: 

n 

4>' -.z^Ylix-nii + z)) 

k=l 

For instance, if a is positive then the roots of T(x + cu) n lie on vertical rays 
emanating from the roots of L n . 

Example 22.11. We use the addition formula j22.4.2b for Hermite polynomials 
to determine 4>'. 

Lemma 22.12. SupposeT(x n ) — H n (x) has roots T\, .. .,r n . The following defini- 
tion of 4>' makes 1 122.0. It commute: 

n 

4>':*.->n(x-(T t +(l/2)(l+z))) 

k=l 

Proof. From the addition formula d22.4.2|) with f = (x + l) n we have 
T(x + l) n | . =T,(x + 2y + l) n 

If we let z = 2y + 1 then 

T(x + l) n (x-l/2 + z/2) =T,(x + z) n 

If r is a root of T(x + l) n , then x — 1/2 + z/2 = r yields a root of the right hand 
side. Thus x = r + 1/2 + z/2. 

□ 

Example 22.13. Suppose that T(x n ) =x n H n (l/x) = H^ ev (x). We have the fol- 
lowing identity that can be derived from the similar identity for the Hermite 
polynomial transformation. 

T(x + 2u-2) n =u n T(x n ) (-) (22.3.2) 



CHAPTER 22. TRANSFORMATIONS IN THE COMPLEX PLANE 



665 



Lemma 22.14. Suppose T(x n ) = x n H n (l/x) has roots ri,...,r n . The following 
definition 4>' makes J22.0.11 commute: 

b'-z^Yl (x-Tkd + z/2)) 

k 

Proof. We must show that T(x — z) n = T~[ k (x — r^{l + z/2) ). The conclusion 
follows as before. □ 



22.4 Lines, rays, and sectors 

We show that some transformations preserve lines or rays or sectors in the 
complex plane. 

Lemma 22.15. Suppose T : x n i— ► a n x n is a multiplier transformation that maps 
P v ° s to itself. For any cr e C, we haveT: P° R — ► P aR . 

Proof. Suppose f (x) = ni^* — ^0 where i\ e R, and let g(x) = ri(x ~ r 0- 
Then since T is a multiplier transformation 

Tf(x) = T(f(ox))(x/a)=T(a 11 -g(x))(x/a) = a u T(g(x))(x/a) 

Since T: P — >Pwe know that T(g(x)) G P, and so TF G F aR . □ 

Similarly, the roots of (x — i — a) n lie on the horizontal lines {xr^ + K}. 
Here is a very general way of finding linear transformations that preserve 
lines. 

Proposition 22.16. Suppose f 6 P. The linear transformation 
g i ► f [aD)g maps p^+ aR to itself for anyleM. and cr G C. 

Proof. Since f is a limit of polynomials in P, it suffices to prove the result for 
linear polynomials. So, choose g G p £ .+ <jR ; ft suffices to show that (crD + |3)g = 
crg' + |3gisalsoinP t+<TR forany Pel. Since ^g(x + y) = g'(x + y), we take 
£ = 0. 

If g G P s and we write g(x) = YIi( x - OTk) with n G R, then 



If we define h(x) = cr n g(crx) then h G P. Note also that h'(x) = cr n+1 g'(crx). 
When k(x) = crg'(x) + (3g(x) then 

k(crx) = crg'(crx) + |3g(crx) = cr n (h'(x) + (3h(x)). 

Since h G P, it follows that cr n k(crx) G P, and hence the roots of k(x) are real 
multiples of a. □ 



Corollary 22.17. Iff (x) G P a+xR then f (x + 1) + f (x - 1) G P 
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Proof. Simply note that 

f (x + 1) + f(x - 1) = (e D + e -D )f = 2cos(iD)f 

and cos(x) G P. □ 

Remark 22.18. This transformation appears in [26], where it is used to show 
that certain polynomials have roots whose real part is 1/2. 

In the the next two examples the regions are rays and horizontal lines. 

Lemma 22.19. Consider T: x k i— > Hk(x)x n ~ k acting on P[n). Every polynomial 
with roots on a ray in the complex plane based at 2 is mapped by T to a line through 
the origin with the same slope. 

Proof. Assume that f has all its roots on the line 2 + acr, where cr € C is fixed, 
and a > 0. If we define S(f ) = T(f (x — 2)) , then it suffices to determine the 
action of S on polynomials with roots on the ray through cr. Since <7~ n f(crx)) 
has all positive roots, the result follows from Lemma [9.601 □ 

Lemma 22.20. J/T(x k ) = H k (x) then P R+otl — ► p R +<"/ 2 . \ n other words, i/f (x) 
is a polynomial whose roots all have imaginary part <xx, then the roots of T(f ) all have 
imaginary part 

Proof. The addition formula for Hermite polynomials 117.8.41 implies that 

T(x n )(x + u) =T,(x + 2y) n 

and by linearity 

T(f}(x + y)=T«f(x + 2y) (22.4.1) 

Replacing x + y by x in this equation yields 

T(f) = (T*(f(x + 2y))(x-u) (22.4.2) 
Equation (|22.4.21 implies the following diagram commutes: 



XH^X+ OCX 



pK+OCl p 

T 

pR+ai/2 ^ p 

□ 

Multiplier transformations preserve sectors because rotation commutes 
with such transformations. 

Lemma 22.21. Suppose T: x n i— » a n x n maps P to itself and has all positive coeffi- 
cients. IfS is a sector of angle at most n then T maps P $ to itself. 
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Proof. Since T preserves interlacing and the sign of the leading coefficient it 
maps f + ig € 3 to Tf + iTg e 3. If |cr| = 1 is a rotation then T maps P~ aH to 
itself. Consequently. 

-p. p— crHn — H p— crH q p— H > p— crH p. p— H p— crHn — H 

We can choose cr so that —crH n — H is a sector of any desired opening of at 
most 7t. Another rotation to S finishes the proof. 

□ 

22.5 The set of roots 

In this section we look at properties of the set of roots. If we are in the fortunate 
situation of 1 122.0. 1|| then for any subset S of C we know that 

roots(T4>S) = roots ( cj) 'S) 

In general we can not describe the behavior so well. We next note that we 
can refine Lemma [6.21l as follows: 

Lemma 22.22. Suppose that!: p (a ' b > — ► p preserves interlacing. Then 

roote(T(<|>(a,b))) = roots(T(P [a - h) )). 

Proof. The proof follows from the proof of Lemma [6]21] and using the continu- 
ity of roots. □ 

If l n is the largest root of L n , then from Lemma |22.10| 

roots( T(cf)A) ) = |J{roots of T(x - z) n | |z| sC 1} 

consists of all points in the circle of radius i n centered at i n . See Figure 122.21 
which shows roots(T(4 ) (3A)) ) when n is 5 . Since i n goes to infinity as n goes 
to infinity, the union of all these circles covers the right half plane. This proves 

Proposition 22.23. IfT(x k ) = L k/ then{SR{Q ^ 0} C roots( T(cj)A) ). 

In other words, if 9t(£,) ^0 then there is a z and positive integer n where 
|z| ^ 1 such that £, is a root of T(x — z) n . 

22.6 Complex Transformations involving f (x + i) 

We introduce some transformations that involve complex numbers, and show 
that they preserve P because they are other known transformations in dis- 
guise. 

To begin, define the transformation T(f) = f(x + i) +f(x — i). If f has all real 
coefficients, then T(f ) has all real coefficients, since Tf is equal to its conjugate. 
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Figure 22.2: Image of the unit circle under Laguerre transformation 

Lemma 22.24. The transformation T: f (x) i— > f(x + i) + f(x — i) preserves roots. 

Proof. Since e aD = f(x + a) it follows that cos(D)f = =Tf. Since cos(x) is in P, 
cos(D) preserves roots, and thus so does T. □ 

Remark 22.25. More generally, if we use the fact that cos(ax + b) e P we see 
that 

cos(aD + b)f (x) = i(e lb f (x + la) + e~ lb f (x - ia)) 

is in P. E3 

We can iterate the construction of the following lemma. If we let the num- 
ber of terms go to zero then the sum becomes a convolution. See Il29l . 

Lemma 22.26. If a\, ... ,a r ,b\, ... , b r are real numbers, and f € P, then 

^ Y_ f (x ± aii ± a 2 i± ■ ■ ■ ± a r i)e l(±bl±b2 - ±b ^ e P 
where the sum is over all 2 T choices of± signs. 

Proof. This is cos(aiD + bi) • • • cos(a r D + b r )f. □ 
We get a similar result using sin instead of cos. The proof is omitted. 

Lemma 22.27. The map f (x) i— > f|x+l| 2i f( ' c ~ l1 maps P to itself. 
Next we look at a variation of the above. 

Lemma 22.28. If n is a positive integer and f (x) e P then 

f(x)+ Y_ f(x + lci) e? 
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Proof. We compute that 

n 

f(x) + Y_ f(* + ki) =2(l + cos(D)+cos(2D)+---+cos(nD))f(x) 

"sin((n + l)x/2) 



k=— n 



sin(x/2) 



cos(nx/2) 



(D)f(x) 



Now cos(nx/2) is in P, and so is the first factor, since we can write an infinite 
product expansion for it, and the factors on the bottom are canceled out by 
terms on the top. □ 



Corollary 22.29. J/f (x) e P then 



f (x + it) dt € P. 



-l 



Proof. Approximate the integral by the sum 



^ ff(x) + Y_ f(x + lk/rt)^ . 



2n 

This sum is in P, for we can scale f (x) in the Lemma. 

Here is an alternative proof not using sums. If T(f ) is the integral in the 
Corollary, then 



T(x n ) = 



(x + it) n dt = - 
-l i 



x + i 



n+l 



IX — I 



n+l 



n + l 



n+l 



= 2- 



e vD_ e -iD x n+l 



2i n + 
Thus T(f) = 2sin(D)Jf. This in in P since 
'sinfx 



- =2sin(D)Jx r 



and 



sin(x) 



srn(D)Jf = 



is in P. 



D))D(Jf) = (^^(DMf 



(22.6.1) 



□ 



Translating and scaling this result gives the following corollary, which will 
prove useful when we look at polynomials with complex coefficients (Chap- 
ter!^ 



f (x + oat) dt 



Corollary 22.30. If a > and f (x) e P then the all the roots of 
have imaginary part —a/2. 
Proof. Observe that 



f (x + ait) dt = - 
o 2 



f (f + ait/2) dt = - 
o 2 



f([x- ai/2] + ait/2) dt 



Since the last integral has all real roots, the roots of the first integral have 
imaginary part —a/2. □ 
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22.7 The Hermite-Biehler problem 

This section is concerned with determining how many roots of a polynomial 
with complex coefficients lie in the upper half plane. If a polynomial h has a 
roots in the Upper half plane, and |3 roots in the Lower half plane, then we 
define uX(h) = (a, (3). 

The Hermite-Biehler problem: Suppose that f, g have all real co- 
efficients and no common real roots. Express UX(f + ig) in terms 
of the arrangement of the roots of f and g. 

Suppose f and g are two polynomials with real coefficients, with perhaps 
some complex roots. The polynomial f + ig has the following fundamental 
property which follows from the fact that f and g have all real coefficients: 

If f + ig has a real root, then f and g have that root in common. 

Thus, if f and g have no real roots in common, then all the roots of f + ig are 
complex. Our approach is based on this simple consequence: 

If f t and g t are a continuous family of real polynomials where ft 
and g t have no common root for all ^ t ^ 1, then KX(fo + igo) = 
uX(fi + igi) since the roots can never cross the real axis. 

We begin by describing three transformations of f + ig that preserve the 
number of roots in each half plane. We assume that f and g have no common 
real roots. If ltC(f + tg) = (a, |3) thenKX(— f — ig) = (a, (3). Taking conjugates 
shows that 

uX(f - ig) = uX(-f + ig) = (|3, a) 

Consequently, we will always assume that f and g have positive leading coef- 
ficients. 

1. Join adjacent roots 

Suppose that u, v are adjacent roots of g that are not separated by a root 
of f, and let u t = tu + (1 — t)v. The transformation below doesn't ever 
create a g t with a root in common with f since u t lies between u and v. 
Thus UX(h) = UL(hi) since ho = h. 

ht = f + 1 — - — • (x — u t ) 

x — V 

2. Replace double roots with x 2 + 1 

Suppose that u is a double root of g. The following transformation 
moves a pair of roots of g into the complex plane, missing all real roots 
of f, and lands at ±i. Consequently, we get a factor of x 2 + 1. Again, 

UjC(h)=lLG(hi). 

H *= f + V^2 ((*-d-t)u) 2 +t) 
(x — UJ 2 v ' 
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3. Replace complex roots with x 2 + 1 

If a + bx is a root of g then since g has all real coefficients g has a factor 
of (x — a) 2 + b 2 . The transformation below converts this factor to x 2 + 1. 

h t = f + l 7 W~U ((x-(l-t)a) 2 + (l-t)b 2 + t) 

(x — a) 2 + b 2 v ' 

We can also do the above constructions to f. If we do them as many 
times as possible, then we have found a polynomial that we call the HB- 
simplification of h: 

H = (x 2 + l) T F + i(x 2 + l) s G 

The constructions above guarantee that ICC(H) = ICC(H). In addition, F 
and G interlace, since we have removed all consecutive roots belonging to 
only one of f, g. We can now state our result: 

Proposition 22.31. Suppose f and g are polynomials with real coefficients, positive 
leading coefficients, and no common real roots. Let H = (x 2 + l) r F + r(x 2 + l) s G be 
the HB-simplification ofh = f + ig, and set m = max(r, s). Then 



UC(H) 



(m,m) + (0,deg(F)) ifT < — G 
(m,m) + (deg(G),0) ifG < — F 



Proof. If F and G have the same degree then without loss of generality we may 
assume that F«G. Suppose that r > s. Consider 

ht = (x 2 + l) r f + i(tx 2 + l)(x 2 + l) s g 

Since deg(h t ) = deg(h), ICC (Hi) = ICC(H). We can do a similar transformation 
if r < s, so we may assume that both exponents are equal to m = max(r, s). 
If write (x 2 + l) m Hi = H then we see that UC(H) = ICC (Hi) + (m,m). Since 
Hi = F + iG, we can apply the classical Hermite-Biehler theorem to conclude 
that tCC(Hi) = (0, deg(F)). If F < G then we consider G e = eF + G and let e go 
to zero. The proposition is now proved. 

For completeness, here's a quick proof of the Hermite-Biehler theorem. 

Suppose H = f + rg with f«g, and the roots of f are V\, . . .,v n then there 
is a positive cr and positive ch so that 



g = of + Y_ Q i: 



A root p of f + ig satisfies 



Lai 
= l 
P-Vi 



Ip-vil 2 



= i 
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from which it follows that p has negative imaginary part. 

□ 

Example 22.32. Here is an example. Suppose that 

f = (x 2 + 2) (x 2 + 3) (x - 2) (x - 4)(x - 5) (x - 8)(x - 9) (x - 10) 
g = (x 2 + 5)(x - l)(x - 3)(x - 6)(x - 7) 

We write the roots of f and g in order, and successively remove pairs of adja- 
cent roots of f (or of g) 

gfgffggfff 

gfgggfff 

gfgfff 

gfgf 

Thus F< G. Since f has degree 10 and g has degree 6, we see that r = 
(10 - 2)/2 = 4, and s = (6 - 2)/2 = 2. Thus m = 4 and 

UX(f + ig) = (4,6) 

The image of the upper half plane H under a counterclockwise rotation of 
angle a is e la H. A polynomial f (x) has all its roots in this half plane if and 
only if f (e ai ) e P H , and this is the case exactly when 

$H(f(e ai ))<a(f(e ai )) 

and the leading coefficients have opposite signs. However, it is difficult in 
general to express the real and imaginary parts of f(e la ) in terms of coeffi- 
cients of f . When e ia is a root of unity then there is an explicit formula. We 
describe the general case, and then look at three special cases. 

Choose a positive integer n and let C, — e 2m/n . Write f = Y. a i xX an d 
define 

fkM = Y- QsXS 

s=k (mod n) 

We can express f (£x) in terms of the f k : 

n-1 n-1 

f(Cx) = ^fic(Cx) = ^C k fk(x) 

k=0 k=0 

If we define 

n-1 

f" (x) = Y_ cos(2k7t/n) $K(f k (x)) - sin(2k7t/n) 3(f k (x)) 

k=0 
n-1 

Fj(x) = ^sin(2k7t/n)fH(f k (x)) + cos(2k7t/n) 3(f k (x)) 

k=0 
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then since C = cos(2k7t/n) + isin(2k7r/n) 
f(Cx)=F (x) + ih(x) 

and Fo an d Fj have all real coefficients. Consequently, f e iff Fo < Fj and 
the leading coefficients have opposite signs. 

If f (x) has all real coefficients then the imaginary parts are zero and we 
have 

n-l 



Fo(x) = ^ cos(2k7t/n) f k (x) 



k=0 
n-l 

Fj(x) = ^ sin(2k7t/n) f k (x) 

k=0 

We now look at three special cases. 
Example 22.33. If n = 3 then £ = ^ + , and C 2 = - ^ We define 

FqM = SH(fo) + =^5R(fi) - ^a(fx) + i«H(f 2 ) - ^3(f 2 ) (22.7.1) 

Fi(x) = J(f„) + ^9l(fi) + ^3(fi) + ^9t(f 2 ) - ^(f 2 ) (22.7.2) 
If the coefficients of f are real then 

Fo(x)=f + ^fi + ^f 2 

PM \/3 , , V3 
h(x) = — fi + — f 2 

We conclude: 

Lemma 22.34. AZZ roofs o/f /za^e argument between 2n/3 and 5n/3 if and only if To 
and Fi m (|22. 7.1l l, l|22. 7.2b satisfy Fo < Fi and f/iez'r leading coefficients have opposite 
signs. Iff has all real coefficients then the interlacing criterion is 

2f - f i + f 2 < f i + f 2 . 

Example 22.35. We now take n = 4. Since e 2m/4 = i, if we follow the calcula- 
tion above, then 

T = W )-3(ii)-m(f 2 ) + 3(f 3 ) 
F 1 =a(f )+SR(f 1 )-J(f 2 )-5R(f 3 ) 

and consequently f has roots with only negative real parts if and only if 
Fo <Fj. This can be further simplified. If we assume that f(x) has all real 
coefficients then 
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If we write f (x) = p(x 2 ) +xq{x 2 ) then Fo(x) = p(— * 2 ) and Fj(x) = xq(— x 2 ). In 
this case, the result is also known as the Hermite-Biehler [81J theorem: 

Theorem 22.36 (Hermite-Biehler). Let f (x) = p(x 2 ) + xq(x 2 ) have all real coeffi- 
cients. The following are equivalent: 

1. All roots ofi have negative real fart. 

2. p(— x 2 ) and xq(— x 2 ) have simple interlacing roots, and the leading coefficients 
are the same sign. 

Example 22.37. We next take n = 8, where £ = We assume that all 

coefficients of f are real. Define 

FoM=fo + _|__|_ fl __| + i| (22 . 7 . 3 , 

FlW ^ + f2 + ^- f '-^ (22 - ? - 4) 

Lemma 22.38. If f (x) Zzas aZZ reai coefficients, then 9t(r) < 3(r) /or euery root r of 
f (x) if and onZy if Fo, Fi in (|22. 7.31 1, (|22. 7.41 /zaue leading coefficients of opposite signs, 
and satisfy F < Fj, 

Suppose we are given a line £ in the complex plane such that the angle 
the line makes with the real line is a rational multiple of 2n. We can find 
an interlacing restriction such that a polynomial has all its roots in a particular 
half plane determined by I if and only if the interlacing condition is met, along 
with agreement or disagreement of the signs of the leading coefficients. 

We can combine these interlacing and sign conditions, provided the rele- 
vant angles are rational. We can summarize this as 

Lemma 22.39. Suppose that S is a convex region determined by n lines, and each 
line makes a rational angle with the real axis. Then, we can find n interlacing and 
sign restrictions such that a polynomial has all its roots in S if and only if these n 
restrictions are satisfied. 

22.8 The Bocher-Walsh problem 

The essence of the Bocher-Walsh theorem can be stated 111551 in the following 
form 

If all roots of f lie in the upper half plane and all roots of g lie in 



the lower half plane then 



f f 
9 g' 



has no real zeros. 



This is reminiscent of the Gauss-Lucas theorem, which in general follows 
from the fact that if f has all its roots in the upper half plane then so does f '. 
Unlike the Hermite-Biehler theorem, it's easy to determine the location of the 
roots of the determinant. 
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f 


fi 


has | 


g 


9i 





Lemma 22.40. Suppose that f <^ fi has all roots in the upper half plane, and g ^ gi 
has all its roots in the lower half plane. If deg(f) = n > m = deg(g) then 

roots in the upper half plane 
roots in the lower half plane 

Proof. If the determinant is zero then f gi — gfi = 0, so fi/f = gi/g. Because of 
algebraic interlacing, we can find non-negative at, t>t so that 

* — X — Ti * — X — Si 

where roots(f) = (n) and roots(g) = (bt). If we evaluate the equation at any 
real number the left hand side is in the upper half plane, and the right hand 
side is in the lower half plane, so there are no real roots. 

Thus, as f and g vary among polynomials satisfying the hypothesis the 
roots of the determinant never cross the real axis. We may therefore choose 
f = (x — i) n , fi = f ', g = (x + \) m and gi = g', in which case 



f 


fi 




g 


9i 





[X -i) 
'x-i) 



n(x — i 
mix — i 



,n-l 
im— 1 



= IX — I 



,n-l, 



(x + t) ((n-m)x-l-(n-l-m)) 



This determinant has n — 1 roots in the upper half plane, and m in the lower 
half plane, so the lemma is proved. □ 

If n = m then the proof shows that there are n — 1 roots in each half plane. 

22.9 Newton's inequalities in the complex plane 

If we constrain the location of the roots of a polynomial to a region S of the 
complex plane, then geometric properties of S lead to restrictions on the New- 
ton quotients. It turns out that the inequalities only depend on the behavior 
of the Newton quotients on quadratics, so we define 



NQ(S) = inf 

fSP s (2) 



a Q2 



where f (x) = ao + aix + a2X 2 . Consider some examples. The usual argument 
shows that NQ(K+) = 4. Similarly, NQ(aX + ) = 4 for any cr e C \ 0. Since 
(x — cr)(x + cr) has Newton quotient zero, we see that NQ(crM) = for any 
cr e C \ 0. The Newton quotient of (x - cr) 2 is 4, and therefore NQ(§) s$ 4 for 
any S. 

Proposition 22.41. Suppose that § is a region ofC such that § and S _1 are convex. 
If f(x) = Y. a i x i zs i n P S i n ) t nen 



Qk-lGk+l 



NQ(S) /k + 1 n-k + 1 



n 



(22.9.1) 
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Proof. We follow the usual proof of Newton's inequalities. Write 



f(x)=5>m** 



k=0 



where (^bi = a|. The derivative of f satisfies 



f'(x)=n^b k (^_ 1 l )x 

k=l ^ 



k-1 



By Gauss-Lucas, the derivative is in P s since § is convex. If we form the re- 
verse of the polynomial, x n f (1/x) = Y. (?) bn-iX 1 - then the roots of the reverse 
are the inverses of the roots of f, and thus lie in S _1 . Since § _1 is convex, any 

c — 1 

derivatives of the reverse are in P . Consequently, if we apply the derivative 
k times, reverse, apply the derivative n — k — 2 times, and reverse again, the 
resulting polynomial is in P s : 



n • • • (n - k + 1) ^b k + b k+1 (J j x + b k+2 (^J x 2 ) e P s (22.9.2) 

The Newton quotient of this quadratic equation satisfies 

4 b 2 

k+1 ^NQ(S) 



bkbk+2 

Substituting b k = a k / (£) and simplifying yields the conclusion. 



□ 



In general it is hard to determine NQ(§). The next lemma gives a good 
lower bound for the Newton quotient that is often enough to determine 
NQ(S). 

Lemma 22.42. If < a < (3 < 2n and r, s > then the Newton quotient for 
(x - re ia ) (x - se lp ) is at least 4 cos 2 ( ^-). 

Proof. The Newton quotient is invariant under multiplication by any complex 
number, so we multiply by (l/r)e~ ,oc+|3)//2 . Thus we may assume that f = 
(x - e xy ) (x - te~ xy ) where y = [ — <x)/2. If t > then the Newton quotient is 



1 

= - + t + 2cos(2y) 



t 

This is minimized for t = 1, and has value 4 cos 2 [y ) ■ □ 
Sectors satisfy the conditions of the proposition. 
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Corollary 22.43. I/O < oc < |3 < n and S is the sector a < argz < (3 then 



at 



dk-lO-k+l 



> cos 2 ( 



a- (3 ^ + 1 n-k+1 
k n — k 



Proof. If u, v are any two points in the sector, then the difference of their ar- 
guments is at most (3 — a. From the lemma the Newton quotient is mini- 
mized when a and |3 are as different as possible, so the minimum is attained 
at(a-P)/2. □ 

Corollary 22.44. If f (x) = Y. a i xl zs a polynomial of degree n satisfying the condi- 
tion that the real parts of all the roots have the same sign, and the imaginary parts of 
the roots all have the same sign then 



at 



ak-iak+i 



k+1 n-k+1 



n 



Proof. The region § is a quadrant. Thus a — |3 = 7t/2, and 4 cos 2 ( ) = 2. 

□ 

Corollary 22.45. If f (x) = ^ qix 1 z's a polynomial of degree n w/zose roots a// feroe 
the same argument then 



ak-iak+i 



k + 1 



k+1 



n 



We can give a lower bound for more general regions. 

Corollary 22.46. If S and § _1 are convex regions, and if § is contained within a 
sector of angle width a then for f e P s 



at 



ak-iak+i 



^ cos 2 (a/2) 



k + 1 



n 



k + 1 



n — k 



For an example of such a region, consider a circle of radius 1 around cr 
where cr| > 1. Simple trigonometry shows that the cosine of one half the 
angle between the two tangents to the circle is ^|u| 2 — l/|cx|. We conclude that 
for all f with roots in this circle we have 



at 



Qk-iak+l 



k+1 n-k+1 



n 



If we restrict f to have all real coefficients then the results are simpler. 

Corollary 22.47. Suppose that § and S~ : are convex regions ofC. Ifi (x) = Y. a i x i 
is in P s (n) then 



ak-iak+i 



^ inf_cos (arg(z)) 

zeSns 



k+1 n-k+1 



n 



(22.9.3) 
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Corollary 22.48. I/O ^ a < n and § is one of the two sectors 

1. — cc ^ argz < a 

2. 7t — a ^ arg z ^ 7T + a 

and f is a polynomial with all real coefficients in P s (n) then 
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Equivariant Polynomials 



In this chapter we consider polynomials that are fixed under the actions of a 
cyclic group. 

23.1 Basic definitions and properties 

A polynomial is equivariant if it is fixed under the action of a group. For poly- 
nomials in one variable the group we consider is Z/nZ, the integers mod n, 
for which we take the multiplicative generator £ = e 2m ^ n . The action of a 
group element C k takes a polynomial f(z) to f(£ k z). Since Z/nZ is cyclic we 
only need to verify that f (z) = f (£z) in order to know that f is equivariant. 

When n is 1 then all polynomials are equivariant and we are left with the 
theory of polynomials in P and . In general, we want to consider equiv- 
ariant polynomials that have the largest possible number of real roots. Un- 
fortunately this is not quite the definition because we must treat zero roots 
specially: 

Definition 23.1. The set P/ Z n consists of all polynomials with real coeffi- 
cients fixed under the action of Z/nZ such that if we write f(z) = z s g(z) where 
g(0) ^ then g has degree nk and k real roots. We also define (where g is as 
in the preceding sentence) 

P + / Z n = {f e P/ Z n | all real roots of g are positive} 
P - / Z n = {f £ P/ Z n | all real roots of g are negative] 

We should make a few observations about this definition. First, if f (z) 
is an equivariant polynomial with a term cz k then f(z) = f(£z) implies that 
cz k = c(£z) k which requires that £ k = 1. Since C is a primitive n-th root of 
unity, the non-zero terms of an equivariant polynomial have degree divisible 
by n. This shows that s in the definition is also divisible by n. Moreover, if g 
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in the definition has a real root r then it also has roots r£, r£ 2 , . . . , rC n_1 . If n is 
odd then these are all complex. Thus when n is odd the number of non-zero 
real roots of any equivariant polynomial is at most the degree of g divided by 
n. If n is even then the number of positive roots is at most the degree divided 
by n. In addition, if n is even then g has positive and negative roots, so both 
P + / Z n and P - / Z n are empty. 

From the remarks above we see that if monic f G P/ Z n has real roots (or 
positive roots if n is even) T\,...,Ty, then we can write 

f (z) = z ns ]](z - r k ) (z - Cr k ) • • • (z - C'W) 

k 

= fV-) 

where 

f(x)=z s ;f[(x-r£) 

k 

This gives us a useful representation for polynomials in P/ Z n . A polyno- 
mial f (z) is in P/ Z n iff there is a polynomial f € P (or P alt if n even) such 
that f(z) = f(z n ). Since this correspondence is linear, we have a linear trans- 
formation 




n even 
n odd 



that is a bijection. 

Example 23.2. Consider the case n = 2. Since P/ Z 2 = {g(x 2 ) | g e P alt } if g 
has non-zero roots T\,..., r m then g(x 2 ) has non-zero roots ±y/r[ and hence 
P/ Z2 C P. The Hermite polynomials H n are invariant under Z 2 . Corol- 
lary [731] shows that the map x k 1— > Hk(x) defines a linear transformation 
P/ Z 2 — > P/ Z 2 . 

If n > 2 then P/ Z n is not contained in P. 

It is at first surprising that the derivative does not preserve P/ Z n : 

Lemma 23.3. Ifn is odd and f e P/ Z n then f £ P/ Z n &wf f (n) e P/ Z n . For 
any Owe have that z k f (k) (z) ts in P/ Z n . 

Proof. From the definition 

f(C(z + h))-f(Cz) , 

f (Cz) = hm — = C f (z) 

C^o Ch 

This shows that f (z) is not equivariant, but that zf'(z) is equivariant. Similarly 
f (k) (cz) = C~ k f (k ' (z) so that z k f (k) (z) is equivariant. It remains to show that 
f has the right number of real roots. 
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From Rolle's theorem, if a polynomial p has s real roots, then the derivative 
has at least s — 1 real roots, and they interlace the real roots of p. It might be 
the case that p' has more roots than p, but we always have that if p has a roots, 
and p' has b roots then a = b + 1 (mod 2). 

Without loss of generality we may assume that f has all distinct roots, all 
non-zero. Assume that n is odd. If f has degree nra then f and f have m non- 
zero real roots. The derivative f ' has either m — 1 or m — 2 non-zero real roots, 
and an n — 1 fold root at 0. The total number of roots of f has opposite parity 
from m, so f ' has exactly m— 1 non-zero roots. Since zf ' is equivariant, we can 
write zf'(z) = g(z n ) where geP and hence zf'(z) e P/ Z n . 

Next, f (2) has at least m — 2 non-zero roots, and an n — 2 fold root at 0. The 
same argument as above shows that z 2 f ,2) has an n fold root at 0, and exactly 
m — 1 non-zero roots. Consequently z 2 f ,2) is in P/ Z n . Continuing, f (l) has 
an n — i fold root at and exactly m — 1 non-zero roots. As above, z T f (l) is in 

P/ Zn. 

Since z n f (n) is in P/ Z n we can divide by z n and thus f ,n) has exactly m— 1 
non-zero roots, and is also in P/ Z n . □ 

Notice that although there is a bijection between P/ Z n and P some results 
are more natural in P/ Z n . For example, if f (x) = g(x 5 ) then f (5) is in P/ Z 5 . 
In terms of g this says that if g € P then 

120 g'(x) + 15000 xg"(x) + 45000 x 2 g (3) (x) + 25000 x 3 g (4) (x) + 3125 x 4 g (5) (x) 
is also in P. 

There is a general principle at work here: properties of P extend to P/ Z n , 
but we often have to replace x with z n , or f with f ,n) . 

Corollary 23.4. If n is odd and k is a positive integer then the linear transformation 
T: x j i ► (jn) k x j maps P to itself. In addition, x' also maps P to itself. If 

n is even the domain of the maps is P alt . 

Proof. The map T is the composition 

P ^ P/ Z n — > P/ Z n ^ P 
where * is the map g(z) g(z n ). To verify this claim, we calculate the image 
of a monomial z ] : 

z j i — z m i zkf(k> > z k (jn) k z in - k i ^ dj})^ 

To derive the second part, divide by the constant k!. □ 

23.2 Interlacing 

We make the usual definition of linear interlacing: 

Definition 23.5. Two polynomials f, g e P/ Z n interlace if and only if f + ag is 
in P/ Z n for all real a. 
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It is very easy to determine interlacing in P/ Z n . 

Lemma 23.6. Suppose f, g are in P/ Z n . Iff and g interlace then f and g interlace. 
The converse is true ifn is odd. 

Proof. Since the map f i— ► f is linear it follows that if f , g interlace then f , g 
interlace. Conversely if f and g interlace then f + ag is in P, and hence f + ag = 
(f+ag)(z n )isinP/Z n ifnisodd. □ 

If n is even then the converse is not true. For instance, if f = x — 1 and 
g = x — 2 then f (x 2 ) and g(x 2 ) do not interlace. However, the positive roots of 
f(x 2 ) and g(x 2 ) interlace. 

Corollary 23.7. I/tgeP/ Z n and f, g interlace then the degrees ofi and g differ 
by — n, 0, or n. 

Proof. The degrees of f and g differ by at most one. □ 

Corollary 23.8. Iff G P + / Z n then f <zf <z 2 f (2) < ■ ■ • . 

Proof. If we write z x f' x ' = gt(z n ) then the proof of Lemma 123.31 shows that 
9i ^ 9i+i an d the Corollary follows. □ 

Every linear transformation on P (or P alt is n even) induces a linear trans- 
formation on P/ Z n , and vice versa. So suppose that T: P — > P. We define a 
linear transformation T* by the composition 

P/ Z n P 

A 

T, I T 

P/ z n ^ p 

We can also use the diagram to define T if we are given T*, so linear trans- 
formations on P/ Z n are not very interesting. For instance, we have 

Lemma 23.9. If I, g G P/ Z n then the Hadamard product f * g is in P/ Z n . 

Proof. Observe that (f * g)(z) = (f * g) (z n ). □ 

23.3 Creating equivariant polynomials and the Hurwitz 
theorem 

Whenever a group acts on functions we can construct equivariant functions by 
averaging over the group. In our case, we define the average of a polynomial 
over Z/nZ to be 



1 

f(z) = -T_ nC k z) (23.3.1) 
n * — 



k=0 
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The average is obviously equivariant, and has a very simple interpretation 
in terms of coefficients. If f = ai<x k then 



1 / n_1 

k \j=0 



z k 



and since £ is a root of unity 



n-1 (- 
j=0 I 



kfn 
n kin 



This gives us our fundamental formula: 

f(z) = Y_ Qk * k (23.3.2) 

k=0 (mod n) 

Since the even part of a polynomial is just the average of f over Z/2Z the 
Hurwitz theorem Theorem |7.64| can be restated: 

The average over Z/2Z of a polynomial in P pos is in P alt . 

More generally we have 

Theorem 23.10. Ifn is odd and f e P alt then f e P + / Z n . 

Proof. A stronger result has been proved earlier. Here we give an alternate 
analytic proof of a part of Theorem l7.65l The key idea is to apply l|7.4.2) l since 
we can also express f as g(xD)f where 

. , 1 sin(7Tz) 



n sin(7tz/n) 



In order to see this, notice that the zeros of g are at the integers not divisible 
by n, and if k is an integer then g(kn) = 1. (If n is even then g(kn) = (— l) k .) 

If f e P alt (m) then g(z) has m - [m/n\ zeros in (0, m). By Corollary II .151 
it follows that g(xD)f has at least [ttv/tiJ zeros. However, if f = £ atx 1 then 

g(xD)f = Y_ gUHx 1 = Y_ = f 

Now f has degree n[m/nj and hence has at most [ra/TT-J real roots. Conse- 
quently f has exactly Lth/tlJ real roots, and is in P + / Z n . □ 

Corollary 23.11. Iff = £ a k x k is in P alt (or P alt ) then £ a nk x k is in P alt (or 

Corollary 23.12. Ifn is odd then ^™ =0 is in P alt . 
Corollary 23.13. Iff<ginP then fig. 

Proof. The average is a linear transformation P alt — ► P/ Z n . □ 
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23.4 Hermite polynomials 

The Hermite polynomials have analogs in P/ Z n . If we replace x 2 by x n in 
||7.8. Il l we get polynomials H£ that are nearly equivariant. 



which shows that the degree of H£ is k(n — 1). If we write z k H£(z) = g£(z n ) 
then the degree of g£ is k and 



From the recurrence we see that H£ satisfies H£(Cz) = C k H£(z). The 
recurrence also shows that all g£ are in P alt , and hence 

Lemma 23.14. The polynomials z k H£ are in P + / Z n (or P / Z n if n is even). In 
particular, H£ k (z) is in P/ Z n . 

We can also express H£ as a composition: 




From the definition we find the recurrence 



g£ +1 (x] = (nx + k)g£(x)-nxg'(x) 



K(z) = (nz 



D) k (l 



Chapter 

24 



Polynomials modulo an ideal 



If § is a set of polynomials in variables x, and 3 is an ideal in M[x] (or C[x]) 
then in this chapter we study S/j. There are two cases that we are particularly 
interested in: 

1. P/{x n } 

2. u 2 /{y n } 

We want to find intrinsic descriptions of ^3 as well as to resolve the usual 
questions of interlacing, coefficient inequalities, and linear transformations. 

24.1 Initial segments of P 

An element of P/{x n } can be viewed as a polynomial f = cux 1 such that 

there is an extension to P. By this we mean a polynomial f in P such that the 
first n coefficients of f and f are equal. For instance, 1 + 7x + 28x 2 e P/{x 3 } 
since there is a polynomial, namely (1 + x) 7 , that begins 1 + 7x + 28x 2 + • • • . 
We sometimes say that elements of P/{x n+1 } are n-initial segments of P. Note 
that P(n — 1) C P/{x n } since f € P(n — 1) is its own extension. 

Although P/{x 2 } = {a + bx a, b 6 M} is trivial, even the quadratic case for 
P pos takes some effort. 

Lemma 24.1. P pos /{x 3 } = {a + bx + cx 2 | a,b,c>0 & b 2 > 2ac} 
Proof. If f = Y_ a i x ^ S P pos (n) then Newton's inequalities yield 



Qo Q2 1 n — 1 

Conversely, assume that b 2 > 2ac. If we can find f = a + bx + cx 2 + • • • e pP° s 
then if we divide by a, and replace x by xb/a we can find 1 + x + Cx 2 + • • • 6 P 
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where < C < 1/2. It therefore suffices to show that if < C < 1/2 then we 
can find f = l + x+Cx 2 + -- -e P pos . 
The quadratic equation shows that 

1 + x + tx 2 G P for all < t < 1/4 

Squaring yields that 

(1 + x + tx 2 ) 2 = 1 + 2x + (2t + l)x 2 + • • • G P for all < t < 1/4 
and replacing 2x by x yields 

1 + x + (1/4 + t/2)x 2 + • • ■ G P for all < t < 1/4 
If we let s = 1/4 + 1/2 we have 

1 + x + sx 2 H G P for all < s < 3/8 

Continuing by induction, if we expand (1 + x + tx 2 ) 2 ", we get that 

1 + x + sx 2 H G P for all < s < 1/2 - 2~ n 

Consequently, if we choose n so that 2~ n < 1/2 — C then we can realize 1 + 
x + Cx 2 as a 2-initial segment. □ 

The argument in the lemma shows that powers of 1 + x + ax 2 converge, 
after renormalization, to 1 + x + x 2 /2. Here's a more general result that shows 
why the exponential function appears. 

Lemma 24.2. Iff = 1 + a\% H h a n _ix n_1 + x n then 

lim f(x/n) n = e QlX 

n— »oo 

Proof. If we write f = Yl( x + r then Yl r i = 1 an d Y. V r i = a i- 
lim f(x/m) m = lim FTf*/™ + n) m 

m — >oo x — s-oo 

n / \ m n 

=n Hm — +i ) =n 



1 



□ 



Not only do the powers converge to an exponential function, but the coef- 
ficients are bounded by the coefficients of e x . In addition, Lemma [24. 2l shows 
that there are polynomials in P pos /{x n+1 } whose coefficients are arbitrarily 
close to 1/i!. 

Lemma 24.3. J/1 + x + a 2 x H h a n x n G P pos then < cu < 1/i!. 
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Proof. Newton's inequalities give that 1/2 > a 2 . Assuming that the Lemma is 
true up to n — 1, we see that 

a n _ 1 > — — a n _ 2 a n Newton 

1 n-1 1 j 
Qn < 7 induction 

(n-1)! 2 n a n _ 2 

a n < 1/n! induction 

□ 

If f = b + b x x + • • • e P pos /{x n+1 } then 

1 ,b X N 

T-f hr- ) =l+x + --- 
b bi 

is also in P pos /{x n+1 } Thus, in order to describe P pos /{x n+1 } it suffices to de- 
termine which polynomials 1 + x + a 2 x 2 + • • • + a n x n are in P pos /{x n+1 }. The 
lemma shows that the set of all such (a 2/ • • • , a n ) is bounded. 

If 1 + x + a 2 x 2 + a 3 x 3 € P pos /{x 4 } then we know several bounds: 

1/2 > a 2 > 
1/6 > a 3 > 
a 2 > (3/2) a 3 Newton 
1 + a 3 ^ 2a 2 J24.3.H 

Empirical evidence suggests that these are also sufficient. 
24.2 Initial segments of P2 

We now introduce P po y{u n+1 }. Its elements can be considered to be sums 

fo(x) H + f n (x)u T1 arising from a polynomial in P 2 . Here are two examples 

of elements in P 2 /{y n }. 

• Let f € pv° s . The Taylor series for f (x + y) shows 

f f" f(Tl) 

f+v i! +y2 2! + '" +yn ^r GP2//{l 1 1 

• If we choose f = 0( x + Q i + biTj) and set g = f|(x + at) then 

g+u ^bj— g— +y 2 ^b;b k ■— g— — GP 2 /{y 3 } 

^— X+dj f— X + dj x+a k 

The case n = 2 is very familiar. 

p 2 /{y 2 } = {f + yg I f <ginP} 
Ff s /{u 2 } = {f+-yg I f < g mP pos } 
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We think that the characterization for Pf^Ay 3 } is similar to the result for 
P pos /{x 3 }. Namely, f +y fi+y 2 f2 is a 2-initial segment if and only if f < fi < ^ 
and f j — 2f of2 ^ for positive x. Necessity is easy since we just substitute for 
a and apply the result for P pos /{x}. The problem is existence. 

24.3 Coefficient inequalities 

There are Newton's inequalities for P pos /{x n }. If we choose f = YJa 1 a i x% m 
P pos / {x n } and f has an extension of degree r then Newton's inequalities tell us 
that 



Qkak+2 k r - k 
Since we don't know r all we can say is that 



it+i k+1 



QkUk+2 



In addition to these Newton inequalities, we have determinant inequalities 
for P pos /{x n }. Any subdeterminant of l|4.5.2|l that only involves ao up to a n _i 
must be non-negative. For instance, if ao + Qix + a 2 x 2 + a 3 x 3 G P pos /{x 4 } then 



aj a 2 a 3 
a ai a 2 
a ai 



> 



which gives the inequality 



a 3 + Qga3 > 2aoaiQ2 



(24.3.1) 



If we are given q H h a n x n e P pos /{x n+1 } then there is an f e P pos with 

f = Y_ ck*i/ but we don't know the degree m of f . We can also use Newton's 
inequalities to get lower bounds on m. First of all, 



aic-idk+i k 
Solving for m yields 



m — k 



for 1 < k < n 



m > k + 



m - 



k + 1 



m Vak-jQfc+i k 
These inequalities hold for 1 ^ k < n. For example, if k = 1 the inequality is 



m r> 1 +2 



Q Q2 



-1 



(24.3.2) 
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We see m increases as the Newton quotient gets closer to its minimum value. 
It's not surprising that the exponential series is the unique series where the 
Newton quotient equals its lower bound for all indices. 

For an example, we claim that the smallest extension of 1 + ux + Q)* 2 has 
degree n. Indeed, if we compute the bound (|24.3.2|l we get exactly n. 

Remark 24.4. P/{x n } isn't closed. If we have a sequence fk € P/jx* 1 } such that 
f Tc — * f then the extensions might not converge to a polynomial. For instance, 
424.3.21 shows that 1 + x + x 2 /2 £ P/{x 3 }. It is, however, the limit of (1 + x/n) n 
in P/{x 3 }. 

24.4 There are many extensions 

We show that polynomials in P pos and have extensions of arbitrary de- 
gree. 

Lemma 24.5. If f € P pos /{x n } has an extension of degree m, then f has extensions 
of degree m + k/br k = 1, 2, 

Proof. We observed earlier Cp. 1128b that if f e P(n) then there is a positive oc 
such that f + ax n+1 £ P. Using this fact, we establish the lemma by induction. 

□ 

Lemma 24.6. If f S F^/ hj n } has an extension of degree m, then f has extensions of 
degree m + kfor k = 1, 2, 

Proof. If f G Pj 055 (n), and g (x, y ) consists of the terms of degree n then we will 
show that there is an e such that 

Fe(x,u] = H*,y) + e(* + u)g(x,y) 

is in P^ os (n + 1). Since the asymptotes of the graph of f are also asymptotes of 
the graph of F e for any e, we see that there is an N such that if |a| > N then 
F e (a,u) € P. Also, for any a we can, by the observation in the proof above, 
find an e such that F e (a,u) G P. Since [— N,N] is compact, we can find an e 
that works for all points in this interval, and hence it works for all of K. □ 

A polynomial in P(n) determines an element in P/{x r } for 1 ^ r < n. 
However, it is not true that an element of P determines a member of P/{x r }. 
For instance, e x S P, but it is known [187[ that no k-initial segment is in P 
except for k = 1. Corollary 15.201 is an example of the other extreme. The 
polynomial 

oo 

^x k 2- k2 
1 

has the property that all initial segments are in P pos . 
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24.5 Interlacing 

We can define interlacing as usual. We say that f and g interlace, written f ~ g, 
in P/3 if f + cxg € F/3 for all a e K. If we restrict a to positive values we write 

f ~ g. 

Here is a simple result. 

Lemma 24.7. Suppose a + bx + cx 2 and A + Bx + Cx 2 are in P pos /{x 3 }. Then, 
a + bx + cx 2 i A + Bx + Cx 2 if and only z/bB ^ aC + cA. 

Proof, a + bx + cx 2 + a(A + Bx + Cx 2 ) is in P pos /{x 3 } if and only if 

(b + aB) 2 >2(a+aA)(c + aC) 

and this holds for all positive a if and only if bB ^ aC + Ac holds. □ 

It is not easy to show that two polynomials do or don't interlace. For ex- 
ample we show that x and (x + l) 2 do not interlace in P/{x 3 }. If they did then 
(x + l) 2 + ax e P/{x 3 } for all a. Newton's inequality says that 

(2 + a) 2 ^ 2 

and this doesn't hold for all a, so they don't interlace. It is the case that for 
any positive t 

x ± ( x + t) 2 in P/{x 3 } 

since (2t + a) 2 > 2t 2 for a ^ 0. 

Since there is no concept of degree in P/{x n+1 } there is no idea of the direc- 
tion of interlacing, so we can not expect simple inequalities between interlac- 
ing polynomials. We do have 

Lemma 24.8. Suppose £j aiX 1 and Y_ biX 1 are interlacing polynomials in P/{x n+1 }. 
The following inequality holds for < k < n — 1: 

_ 2 

(k + 2)a k b k+2 + (k + 2)a k+2 b k - 2(k + l))a k+1 b k+1 ' 

4(k+ 1) 2 ( a{ +1 - ^4a k a k+2 ] (b k+1 - ^4b k b k+2 



k+1 T V V k+1 

Proof. Apply Newton's inequality to ^ ( a k + ab k )x k . The result is a quadratic 
in a; the inequality is the statement that the discriminant is non-positive. □ 



24.6 Linear transformations 



A linear transformation P — > P does not in general determine a linear trans- 
formation P/{x n } — > P/{x n } since T applied to different extensions can give 
different results. We will see that Hadamard products do preserve P/{x n }, and 
there are some other interesting preserving transformations arising from P^ os> . 
We recall an elementary result about ideals. 
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Lemma 24.9. If 3,3 are ideals o/R[x] then a linear transformation T on R[x] such 
that T(U) c 3 determines a map T* : P/3 — ► P/d- 

Proof. The point is that the map involves a choice, and the hypothesis makes 
the map well defined. We define T* by 

T „ 



P 

A 

choose extension 

P/J 



project 



■P/3 



□ 



For example, the derivative map D : f i-» f ' maps the ideal generated by 
x n to the ideal generated by x n_1 . We therefore have 

D: P/{x n } — > P/lx^ 1 } 

More generally, we have 

Lemma 24.10. Iff e P(r) f/ien f (D) : P/{x n } — ► P/{x n - T }. 

P is closed under products, and 3 x 3 c 3 so the product induces a linear 
transformation 

p/j x p/a — ► p/a 

If f = a i xl an d 9 = ^ biX 1 then the matrix representation of 
P/{x n } x P/{x n } — » P/{x n } 

is given by 



/ b \ 
bi 



/ a 



a 



\ / b \ 
bi 



\t>n-l/ \a n _i Q k -2 •■• do J \b n -lj 

We get more interesting matrices if we use two variables. Consider 



K os x p 



multiplication — pos extract coefficient _ 
1 o > 1 



1 2 a i f i 2 

If we take the coefficient of y n then we get a map 

5P os ^ lJ/r^n+1-i multiplication — p0 s ; rn+li coefficient of y n p /r n+li 



F P 2 ° b X F/{x n+1 } 



pT/{x n+1 } - 



P/{x n+1 } 



For example, take g(y) — Y. bit) 1 € P and f (x,y) = Y. aijX 1 ^' € P2. Then 
f g e P2, and the coefficient of y™, is in P. The coefficient of x'u n is ^ biQ j/n _i. 
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This gives a matrix that preserves P/{x n+1 }. For instance, if we consider 2- 
initial segments and take n = 2 then the mapping 

U02 QQl lOO 
Q 12 an Qio 
Q22 Q21 O-20 

maps P/{x 3 } to itself. Similarly, we can find matrices that preserve P2/{y n }- If 
we take f(x,t),z) e P3 and g(x, z) e P2 then 

Theorem 24.11. Suppose f e P3 z^/zere f = ^ fy (x) t) l z' . if we define 

t/ze« M maps P?j{y n } to itself. 

Consider an example. Choose g e pP° s / and consider g(x + y + z) in 
Theorem l24.11l The matrix M is 



/g (2) 

2!0! 


g' 

1!0! 


g \ 

0!0! 


g(3) 

2!1! 


g(2) 

1!1! 


g' 

0!1! 


g(4) 

\2!2! 


g(3) 

1!2! 


g(2) 

0!2!/ 



We can use the theorem to find matrices M and vectors v such that M k v 
consists of interlacing polynomials for k = 1, 2, For example, take 

f = (x + y + 1) (x + 2y + 2) (2x + y + 1) 

= (2x 3 + 7x 2 + 7x + 2) + (7x 2 + 14x + 6)y + (7x + 6)y 2 + 2y 3 
v = (2x 3 + 7x 2 + 7x + 2, 7x 2 + 14x + 6, 7x + 6, 2) 
g = (x + u + z + l) 2 (x + 2y + 3z + 4) 2 

/ 6(4x + 7) 22x 2 + 86x + 73 2(x + l)(x + 4)(4x + 7) (x + l) 2 (x + 4) 2 



M = 



30 


V 



2(29x + 53) 
37 




2 (I7x 2 + 70x + 62) 
2(23x + 44) 
20 



6(x + l)(x + 2)(x + 4) 
13x 2 + 56x + 52 
12(x + 2) 



Since M k v e Pi/{y 4 } for k = 1, . . ., it follows that all vectors M k v consist of 
interlacing polynomials. If we take g = (x + y+ z+ l) 6 then the matrix M is 
composed of multinomial coefficients: 



/ (£)(*+ 1) 3 
(3,2) (x+D 



(2,0) (* + D 4 
GJ) (x+i) 3 

(2,2) 
( 2 y 



d 6 o) (* + D 5 
(u) (* + D 4 
(u) U + D 3 
(u) + 
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The Hadamard product defines a map P pos x P — ► P. Since f x • maps 
{x 11 } to itself, we have a map 

P pos /{x n } x P/{x n } — ► P/{x n } 

The converse is slightly more complicated. 

Lemma 24.12. Suppose that the Hadamard product with f maps the closure of 
P pos /{x n } to itself. Then exp(f ) € P v ° s /{x n }. 

Proof. e x is in the closure of P pos /{x n }. □ 

Here is an interesting construction of matrices that preserve interlacing in 
P pos . Recall CP(M) is the characteristic polynomial of M, and M[l,2] is M 
with the first two rows and columns removed. 

Lemma 24.13. If M is positive definite then the matrix below preserves interlacing 
i n pvos 

CP(M[1]) CP(M) \ 
CP(M[1,2]) CP(M[2])i 



Proof. The determinant of 

-xI + M 



'no.. ' 

z ... 

.:: 



equals 

CP(M) +y CP(M[l])+z CP(M[2]) +yz cp(M[1,2]) 
Now apply Theorem l24.11l □ 

24.7 Stable polynomials 

Some stable cases are easy. 
Lemma 24.14. 

2. Jf^/{x 2 } = {do + ctix + 0.2X 2 I ao, ai, a2 > } 

2. l K^{x i } — {ao + aix + Q2X 2 + CI3 x 3 | ao, a\, CL2, 0.3 > & aia2 > aoa3 } 

3. J{/{x 5 }D{a H h a4X 4 | a , ...,a 4 ^0 & aia 2 a 3 ^ a 2 a4 + a^ao } 

Proof. Lemma 121.381 shows that if the inequalities are strict then the polyno- 
mials are already in "K\ . The quadratic is trivial, and we know from 021 . 7.11 1 
that aici2 > aoa3 characterizes cubic stable polynomials. If f G 3i] then by 
Lemma |21.38| we know that its 4-initial segment is in J£ \ . □ 
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Example 24.15. We conjecture that 

M/{ij 2 } = {f<g|f,ge^i} 

The left hand side is certainly contained in the right hand side; we give an 
example to show that certain elements of the right hand side are in the left. 

We know that (x + l) 2 <.x + a in "K\ if and only if < a < 2. Choose 
positive a and consider the matrix 

(\ 0\ , fei 0\ , fO -a 



x 



1 7 y V e 2 J \oc 



1 + a 2 - aVl + a 2 

1 + a 2 + a-v/T 



The determinant of M is 



(l + a 2 )(l+x) 2 +u (ei + e 2 )+x^(l + a 2 )(ei + e 2 ) + (ei-e 2 )ccVl + a 2 

Thus, the first term is a constant multiple of (x + l) 2 , so we need to determine 
what the coefficient of y is. If we divide the coefficient of x in this coefficient 
by the constant term e\ + e 2 we get 



ei - e 2 

1 

ei + e 2 



or j — aV 1 + a 2 



Now as a, ei, e 2 range over the positive reals, the expression above takes all 
values in (1/2, oo), and therefore all linear terms x + a where < a < 2 are 
realized by a determinant. Thus, (x + l) 2 + y(x + a) lies in "K-J {y 2 }. 

The next result is the 2x2 analog of Theorem 124.111 the general case is 
similar. 

Lemma 24.16. Suppose h(x,y) — Y_ h-i{x)y x is in J{ 2 . For appropriate n the matrix 
( H C 1 Hnli ) ma P s xj{y 2 } to itself. 

Proof. We can write F = f + gy + • • • where F e J{ 2 . The coefficients of y n 
and u n_1 in hF are h n f + h n _ig and h n _if + H n _ 2 g. The result follows since 
adjacent coefficients interlace in J{ 2 . □ 

24.8 Other ideals 

A representative element of P2° s /{x 2 ,u 2 } is a polynomial a + bx + cy + dxy. 
The characterization is very simple. 

Lemma24.17. P^ os /{x 2 ,u 2 } = {a + bx + cy + dxy | a,b, c, d > & | £ \ \ < 0} 
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Proof. If f = a + bx + cy + dxy + ■ ■ ■ is in P 2 then Theorer ri9.113l says that 

I c 5 I < 0- Conversely, assume the determinant is negative. By dividing by 
a and scaling x and y it suffices to show that 1 + x + y + Dxy S P^/lx 2 , y 2 } 
where < D < 1. Consider the product 

(l + ax+|3y)(l+ (l-a)x+ (1 - P)v) 

= 1 + x + y + (a(l - (3) + 6(1 - a))xy + • • • 

Clearly since < D < 1 wecanfind ) < a, 6 < 1 so that (ct(l — 6) + 6(l — a)) = 
D. □ 

Although 2xy — 1 ^ P 2; the product 

(x + y + l)(x + y-l) = -l + 2xy + --- 

shows that 2xy — 1 € P2/{x 2 ,y 2 }. A similar argument shows that 

Lemma 24.18. P 2 /{x 2 ,y 2 } = {a + bx + cy + dxy | | *\ | ^ 0} 

The elements of P^ os / {y 2 , z 2 } are interlacing squares with non-negative de- 
terminant 

foo < fio 
|A |A 
foi < f 11 

arising from Y_ f vj (x)y x z' € Pj 08 . See the next section. 
Lemma 24.19. Iff < g,h then f + yg + zh e P 3 /{y 2 ,yz,z 2 }. 
Proof. The argument similar to Lemma [9. 1031 If we write 

g = ^aif/(l + Tix) 

h = ^bt f/(i + nx) 

then 

-1— r f 

TT (1 + nx + cuy + bt z) = f + gy + hz+ V" a^bj — — — -yz H 

II ^7 '(l + nxJfl+TjX) 

□ 

Lemma 24.20. P 2 /{x — y}«P 

Proof. A representative of P2/{x — y} is a polynomial of the form f (x, x) where 
f( x /U) € p 2- Now we know that f(x,x) € P so this gives a map Pi/i* — 
y} — ► P. Conversely, if g € P has positive leading coefficients and we define 
f(x,y) = g((x + y)/2) thenf(x,y) € P 2 andf(x,x) = g. □ 
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If we make a divisibility assumption then we get induced maps. 
Lemma 24.21. Suppose that the linear transformation T satisfies 

1. T(x n ) divides T{x n+1 ) for n > 1. 

2. TiP^^P* 

3. T- 1 : P s — > P s 

T : P7{x n } — ► P7{T(x n )} 
T- 1 : P s /{T(x n )} — ► P s /{x n } 

Proof. If g = Y. Q i. xl is an y polynomial then we claim that there exist ctt € K 
such that 

T(gx n ) G {T(x n )} (24.8.1) 
g T(x n ) = Y_ aiT(x n+i ) (24.8.2) 

For (124.8.1b we have 

J( x n+i) 



T(gx n ) =T(x n )^Qi- 



T(x r 



which is of the form h(x)T(x n ) by the first hypothesis. For 1)24. 8.2b , using 
induction it suffices to show that it is true for g = x. If we write T(x n+1 ) = 
(<x+ |3x)T(x n ) then 

X T(x-) = iT(x- +1 )-|T(x-) 

If fo G V^/{x n } then write f = f + g x n where f 6P S . Applying T 

T(f )=T(f) + T(gx Tl ) 

and using the facts that T(f) G P K and T(gx n ) G {T(x n )} shows that T(f ) is 
in P7{T(x n )}. If f G P s /{T(x n )} then write f = f + gT(x n ) where f G P s . 
Applying T _1 yields 

T- 1 (fo)=T- 1 (f) + T- 1 (gT(x 11 -)) 

Since T~ 1 (gT(x n )) = x n £ o^x 1 and T -1 (f) G P s we see that T -1 (f ) is in 
P s /{x n }. □ 

Corollary 24.22. LetT(x n ) = (x) n . Then 

T : P alt /{x n } — ► P a 7{T(x n )} 

j-i . pp°y{T( x n )} — > p pos /{x n } 
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Corollary 24.23. If a + bx + cx 2 e P pos /{x(x - l)(x - 2)} then (b + c) 2 > 2ac. 
Proof. If a + bx + cx 2 e P pos /{x(x - 1) (x - 2)} then and T(x n ) = (x) n then 

T^fa + bx + cx 2 ) = a + bx + c(x 2 +x) € P pos /{x 3 } 
This implies that (b + c) 2 > 2ac. □ 

24.9 Coefficients of extensions 

In this section we choose f € S/3 and a monomial x 1 , and consider properties 
of the set 

| oc | a, is the coefficient of x 1 in F e § where F = f (mod 3) j 

Our first elementary example is a linear polynomial. We take § = P pos , 
3 = {x 3 }, x 1 = x 3 , and f = a + bx. 

Lemma 24.24. If a, b > then 

| a | a + bx + ax 2 e P pos /{x 3 } j = ( 0, b 2 /(2a) ) 

Proof. Use the fact that a + bx + ax 2 e P pos /{x 3 } if and only if b 2 > 2 aoc. □ 

The corresponding problem for P pos is unsolved. We believe that if f +yg G 
P2° s /{u 2 } (equivalently f < g) then 

h|f + ug + hu 2 eP P 7{u 3 }| = |h|g<h & |U|< } 

The conjectured solution for P pos is similar. If f + gy + hz e P3° s /{u 2 , z 2 } 
then 

jklf + ug + hz + kuzeP^yiu^z 2 }! = |k|g,h<k & |^|<o| (24.9.1) 

We don't know how to approach these problems, but we can establish 
properties of the interlacing squares of the last question. Using these, we 
can establish l|24.9.1t when the degree of f is two. So, we consider the new 
question 

What are the extensions of three interlacing polynomials to an in- 
terlacing square with a positive determinant? 

We are given three polynomials f, g, h in P pos with positive leading coeffi- 
cients that satisfy f < g, f < h, and consider the set X of all polynomials k with 
positive leading coefficient satisfying 
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f < 9 (24.9.2) 

A A 
h < k 

>0 for x e R (24.9.3) 

First of all, 

Lemma 24.25. X is a non-empty bounded convex set of polynomials. 

Proof. Lemma 124.191 shows % is non-empty. Suppose that ki and k2 satisfy 
1 124.9. 2\ and (|24.9.3l l. If < a < 1 then a and 1 — a are positive, so 

f < 9 

A A 
h < ak! + (l-oc)k 2 
From the determinant hypothesis: 

gh - fki > 
gh - fk 2 > 0. 

Multiplying and adding yields 

gh-f(aki + (l-a)k 2 ) >0 



Thus aki + (1 — a)k 2 satisfies the two conditions, and is in %. 

Since f(l) > we see that g(l)h(l)/f(l) > k(l). Consequently, all coeffi- 
cients of k are bounded by g(l)h(l)/f (1). □ 

Example 24.26. If f = (x + l) n , g = h = (x + l) n ~ l then the interlacing re- 
quirement implies that k = oc(x + l) n ~ 2 . The determinant requirement implies 
that < a < 1, and so 

3CC {ct(x + l) n - 2 < cc < 1} 

In order to see that this is an equality we proceed as follows. If q^, bi are non- 
negative then 0^(2^ Qibt) ^ (£j a t ) (T_ bj). So, if < a ^ 1 choose at, b t so 
that Y_ Qi = Y. bi — 1 an d Y. a i^i = 1 — a. We have 

n 

ntx + l + CHU+btz) 
1 

= (x + l) n + y (x + l) n - x + z (x + l) n - a + uz(x + l) n - 2 ^ Q i b i 
This is the desired representation since 

Y a i°j = ( Y- ai ) ( Y- b i ) - X. aibi = i - (i - «) = «• 

i^j V / V / 
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Example 24.27. If F(x,u, z) £ Pj 0S and F = Y_ fij Mv 1 ^ then fn is an extension 
of foo/fiO/foi- Thus, the polynomial fn e 9C. In particular, if f(x) e pp*", then 
f(x + y +z) € P^ os , sof"(x)/2 is an extension of f,f',f. 

In the case that f has degree 2, % is an interval of positive numbers. We 
may assume that the polynomials are as given below, where all constants are 
positive: 

(x + n)(x + r 2 ) < ti(x + Ti)+t 2 (x + r 2 ) (24.9.4) 

A A 
si(x + ri) +s 2 (x + r 2 ) < k 

The construction of the lemma suggests we consider the polynomial 

(x + rj + siy + tjz) (x + r 2 + s 2 u + t 2 z) 

It provides an extension where k is sjt 2 + s 2 ti. Note that this value does not 
depend on the roots of f . Using these polynomials, we have 

Lemma 24.28. The set % of extensions of ( |24.9.4| | z's the interval 

(sit 2 + s 2 ti - 2 v / sis 2 tit 2 , sit 2 + s 2 ti + 2 v / sis 2 t 1 t 2 ). 

Proof. The polynomial gh — kf is a polynomial in x, and for it to be positive 
its discriminant must be negative. The discriminant is a quadratic in k whose 
roots are sit 2 + s 2 ti ± 2 v / sis 2 tit 2 . □ 

An explanation of these conditions is that when n = 2 (|24.9.2b and 424.9.31 
are exactly what we need to extend to a polynomial in Pj 0S . 

Lemma 24.29. Assume that deg(f) = 2. The polynomials f, g,h,k in p v ° s satisfy 
the conditions (|24.9.2b and d24.9.3b above if and only if 

f + gy+hz + uzke F p 3 0S /{y 2 ,z 2 } 

Equivalently, if f + gy + hz e F 3 ° s /{y 2 , z 2 } and f has degree 2 then ( 124.9.1b holds. 

Proof. We only need to show that if we are given f , g, h, k satisfying the condi- 
tions then there is such a polynomial in P3. We start with the matrix 

M _ f{r\ + x + s 2 y +t 2 z)/ri d 2 y + e 2 z \ 

~V d 2 u + e 2 z (r 2 + x+ siy +tiz)/r 2 y 

and notice that rir 2 |M| is in P3 and equals 

f + gy + hz+ (-2d 2 e 2 rir 2 + s 2 ti + sit 2 )uzH 

From the Lemma above we know that we can write 



k = sit 2 + s 2 tj + 2(3 where |0| < v / sis 2 tit 2 
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Since the matrix M is of the form I + xDi + UD2 + ZD3 we know that we 
must choose d.2, e-i so that Di, D2, D3 are positive definite matrices. Now D2 — 

( S2 d 2 Tl S1/V2 ) anc ^ ^ 3 = ( Wtj ) ' me determinant must be positive d2 and 
e2 must satisfy 

sis 2 /(r 1 r 2 ) > &\ 
tit 2 /(r 1 r 2 ) > e\ 

Notice that if d.2, e-i satisfy these inequalities then 



Vsis 2 tit 2 ^ rir2d 2 e 2 
Consequently we can choose d2, £2 satisfying ^ei^i^i = P- □ 

Corollary 24.30. J/ f, g, h, k safr's/y ( |24.9.2t and d24.9.3t and f fras degree 2 f/zen 
I f ? J is a stable polynomial. 

Proof. This follows from the lemma and Lemma 121.641 □ 

Remark 24.31. If the determinant is always negative (rather than positive) then 
the corollary fails. The following polynomials form an interlacing square with 
everywhere negative determinant that is not stable. 



(x + 0.9) (x + 0.97) (x + 9.5) < (x + 0.95) (x + 2.9) 
A A 
(x + 0.94) (x + 2.6) < x + 1.5 
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Glossary of polynomials, spaces, 
and interlacing 



Polynomials 

A n Euler polynomial 

C£ Charlier polynomial with parameter a 

C n (x;y) (-lrOU-x) = (-l)-n!L- x — (y) = (-l)^u!L n (-y; -x - n) 

Gn"' Gegenbauer polynomial with parameter a 

H| Hermite polynomial 

H : Hermite polynomial in d variables 

L n Laguerre polynomial 

L n monic Laguerre polynomial: n!L n 

L£ Laguerre polynomial with parameter a 

L n (x;y) Ui(-x) 

P n Legendre polynomial 

((x-a)A- 1 -p) n (l) 

T n Chebyshev polynomial 

(x) n falling factorial x(x — 1) • • • (x — n + 1) 

(x) n rising factorial x(x + 1) • • • (x + n — 1) 

(x;q) n H-x){l-qx)---{l-q n - 1 x) 
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Sets of polynomials in one variable 

P All polynomials in one variable with all real roots. 

P The analytic closure of P. 

ppos j^l polynomials in P with all negative roots and all positive (or all 

negative) coefficients: p(~°°' ' 

PP°" The analytic closure of P p ° s . 

P alt All polynomials in P with all positive roots and alternating 
coefficients: P 10 ' 00 ' 

P Qlt The analytic closure of P alt . 

p± polt |j ppos 

P 1 All polynomials in P with all roots in an interval I. 

P sep Polynomials in P whose roots are all at least one apart. 

Y^p The analytic closure of P s ep . 

3 Polynomials f + ig where f < g. 

3 Polynomials f + ig where f < g. 

3 Analytic closure of 3. 

"K\ Stable polynomials with real coefficients 

"K° Interior of JC i 

JC Analytic closure of Jfj 

"K\ (C) Stable polynomials with complex coefficients 

5ci ( C ) Analytic closure of J£i ( C ) 
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Sets of polynomials in several variables 



u d 


Polynomials in d variables non-vanishing in the upper half plane 


U d (C) 


Polynomials in d variables and complex coefficients non- 




vanishing in the upper half plane 




Pnlimnmialc in H ^fariahl^c nnn-¥anit;liini7 in fhf=* ricrnt naif nlpinp 

A KJ1 y 1 LUllLlcAlo 111 LI V al laulta null V al L1£>1 111 lii 111 Lilt: lliillL 1LC111 IJ1C11LC 




Polynomials in d variables with complex coefficients non- 




vanishing in the right half plane 


n 


UdlQnJfd 


Pd 


Polynomials in d variables satisfying degree conditions, substitu- 




tion, and homogeneous part in P d -i 


1 d 


i oiynoniiais in trie closure or r d 


Pd 


The analytic closure of Pa 


pes 
d 


Polynomials in with all positive coefficients. 


ppos 
1 d 


Pol vnomials in the closure of P , 

A KJl V A LVAAA LAtAAt^ AA L LA IL- V, 1 V J LAA V — VAA JL J 


1 d 


ine diidiytic closure or i ^ 


^d,e 


Polynomials satisfying substitution for x e K, y ) 0, the degree 




condition, and homogeneous part in P d+e . 


«V 


Polynomials in the closure of ^Ve- 


^d,e 


The analytic closure of ^a,e- 


Qd,e 


Extension of P^° s by e variables subject to restricted substitution 




conditions. 




stable polynomials with real coefficients in d variables 


Md 


analytic closure of IKd in d variables 




Polynomials in P2 whose solution curves are all distinct 


psep 


Polynomials in P 2 whose substitutions are all in P sep . 
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Kinds of interlacing 





9 


f + U9 G U d (C) 




f 


9 


f _|_ i m cz <U , If) 






9 










f + ag e P for a ^ 




, H 
T ~ 9 




f + ag e "K d for a ^ 




f ~ g 




f + ag e for a > 




f<g 




The roots of f and g alternate, and deg(g) 


+ l = deg(f) 


f < — 


9 


The roots of f and g alternate, deg(g) = 
root belongs to f 
f < g or g <C g in P 


deg(f), and the largest 
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Tables of transformations 





f 


T(f) 


Domain 


Range 








Multiplier transformations P > P 












Binomial 


x n 




P 


p 


|681| 


d odd, k 


1,2. 


Binomial 


x n 


(nd) k x n 


P 


p 


|681| 


cl odd, k = 


1,2. 


Derivative 


x n 


nx 11 - 1 


P 


p 


E5] 






Exponential 


x n 


X /TV. 


P 


p 


\192\ 






Exponential 


x n 


x n n!/(ku)! 


P 


p 


|192| 


k= 1,2,.. 




Exponential 


x n 


x n /(kn)! 


P 


p 


|192| 


k= 1,2,.. 




q -Exponential 


x n 


x n /\nV 

A, / L 1 L J ■ 


P 


p 


|170| 


q >1 




q -series 


x n 


n ( ?)-v n 


P 


p 


|170| 


|q|<l 




Rising factorial 


x n 


V. 


P 


p 


|288| 






Transformations P ► P 














Derivative 


f 


f 


P 


p 


|495| 






Falling Factorial 


(S)n 




P 


p 


|185| 






Hermite 


x n 


H n 


P 


p 


|188| 






Hermite 


H n 




P 


p 


|188| 






Laguerre 


x n 


Ln 


P 


p 


|191| 






Laguerre 


X n 


TREV 
n 


P 


p 


|496| 






Laguerre 


x n 


L£ 


p 


p 


|496| 


k = 1,2 




Legendre 


x n 


Pn/n! 


P 


p 


|498| 






Rising Factorial 


(2c> n 


(*)n+l 


p 


p 

1 


11 SRI 






Transformations P pos ► 


P 












Affine 


x n 




ppos 


p 


|250| 






Chebyshev 


x n 


-p REV 


ppos 


p 


|203| 






Laguerre 


x n 


U REV (-x) 


ppos 


ppos 


|277| 






Transformations P pos ► 


ppos 












Charlier 


pa 




ppos 


ppos 


|189| 


7 




Falling factorial 


(*) n 


x n 


ppos 


ppos 


|182| 






Falling factorial 




<*>n 


ppos 


ppos 


|183| 






Hurwitz 


x n | 


x n/2 n even 


ppos 


ppos 


|199| 










n odd 










Rising factorial 




<*>n 


ppos 


ppos 


|183| 







Risine Factorial x n (x) /n! P pos P pos [1851 
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f 


T(f) 


Domain 


Range 






Transformations P 


alt 


P 










Affine 


X U 


(_l)UJ x n 




P 


12501 




Chebyshev 


x n 


"T" REV 

'n 


-nfl 1 t 


P 


12031 




Hermite 


x n 


HnL^ 


-r»n 1 1 

pu. L L 


P 


14961 




Hermite 


x n 


(H n ) 2 


pu. L L 


P 


14961 




Laguerre 


x n 


(En,) 2 


-r»n 1 1 

pu. L L 


P 


14961 




Transformations P 


alt 


palt 










Binomial 


X n 


o 


palt 


palt 


H941 




Binomial 


X n 




palt 


palt 


I68T1 


d even, k 


Binomial 


X n 


(TLd) k X n 


palt 


palt 


16811 


d even, k = 


Charlier 


X 


C n 


palt 
1 


palt 






Falling factorial 


Tl 

X 




palt 


palt 


11 Oil 

11821 




Legendre 


n 

X 


PMlxJ/n! 


Tjalt 

r 


Tjalt 

Jr 


12021 




Rising factorial 




v n 
X 


palt 
1 


palt 
1 


11 QOI 

[iooj 




Transformations P 


1 ^P 


I 










Charlier 


X n 


REV ptX 


p(— 1/0) 


P 


14421 




Chebyshev 


x n 


Tn 


i i ii 
pi— 1,1) 


i ill 
pi — 1,1) 


16151 




Euler 


x n 


A n 


i 1 m 
pi — 1/UJ 


palt 


11971 




Euler-Frobenius 


x n 


Pn 


p(— W) 


p(— 1,1) 


11981 




Factorial 


x n 


nrfi - kxi 


p(0,l) 


p(0,D 


|442l 




Falling Factorial 


X 1 




p(0,l) 


p(0,D 


14421 




Laguerre 


x n 


REV f («) 
i-n 


p(0,l) 


P 


14421 


a ^ -1, 


Q-series 


x n 


(x;q)n 


in i ^ 

p m WW- 1 - J 


in 1 i 


12441 


< q < 1, 


Q-series 


x n 


(x;q)n 


Ir\ 1 i 

p(U/i) 


Id 1 1 

plU,l) 


[2441 


1 < q, 


Q-series ( 


x;q) n 


x n 


pK\(0,l) 


pR\(0,l) 


12441 


1 < q, 


Q-series ( 


x; q) n 


x n 


p(0,l) 


p(0,D 


12441 


< q < 1, 


Rising Factorial 


x n 


(x) n /u! 


p(l,oo) 


p(l,oo) 


H851 




Transformations P 


(n) — , 


•P(n) 










Polar derivative 


X 1 


(n — i)x x 


P(n) 


P(n-l) 


H041 




Reversal 


X 1 


x n-i 


P(n) 


P(n) 


1T731 




Mobius 


X 1 


xHl -x) 11 - 1 


P(n) 


P(n) 


1T461 
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f 


T(f) 


Domain 


Range 




Transformations P r (n) 


—v P 1 fn 1 








Binomial 






P pos (n) 


P pos (n) 


|171| 


Binomial 


X 1 


rr l ) 


P (o,i) (n) 


P alt (n) 


|194| 


Chebyshev 


X 1 


(Tr) 1 


p(-U) (n) 


p(-W)( nr ) 


|168| 


Factorials 




(*)n-k 


ppos 


ppos 


|184| 


Falling factorial 


x i 




P pos (n) 


P pos (n) 


|171| 


Falling factorial 


X 1 




P pos (n) 


P pos (n) 


|184| 


Falling factorial 


x i 




P pos (n) 


P pos (n) 


|184| 


Hermite 


X 1 


Hi Hrx-i 


P pos (n) 


P(n) 


|277| 


Hermite 


X 1 


x n - l Hi 


p(2,oo) (n) 


P(n) 


|277| 


Hermite 


X 1 


x n - i H i 


P pos (n) 


P(n) 


|277| 


Laguerre 


x i 


x n - l U 


P pos (n) 


P(n) 


|277| 


Laguerre 


x i 


U L n _i 


P pos (n) 


P(n) 


|277| 


Rising Factorial 


X 1 




P (tu) (n) 


P pos (n) 


|437| 


Rising factorial 


X 1 




P pos (n) 


P pos (n) 


|184| 


Affine transformations with Ax = x + 1 








Binomial 


x n 


(x+ d-n) d 


palt 


psep 


|227| 


Charlier 


x n 


pa 


p( — a,oo) 


p( — a,oo) p. psep 


|228| 


Derivative 


f 


f 


psep 


psep 


|227| 


Difference 


f 


Af 


psep 


psep 


|227| 


Falling Factorial 


x n 




palt 


palt p. psep 


|228| 


Rising Factorial 


x n 




ppos 


ppos p psep 


|228| 


Transformations of stable polynomials 








Charlier 


x n 


pa 

R 


5Ci(C) 




|646| 


Charlier 


x n 


pa 

TL 


3tj 




|646| 


Difference 


f 


f(x + l)-f(x) 


IR'l 




|648| 


Hermite 


x n 


r n H n (-ix) 




•Ki 


|629| 


Laguerre 


x n 


i n L n (rx) 


[K] 


!Hi 


|629| 


Rising factorial 


x n 


<*} n 


JCi(C) 


JC a (C) 


|645| 


Rising factorial 


x n 


<*} n 


[Kj 


J£i 


|645| 


Falling factorial 


(*)n 


x n 


Mi(C) 


Mi(C) 


|646| 


Falling factorial 




x n 


Mj 




|646| 


Hurwitz 


x n 


J x n/2 n even 




ppos 


|646| 






[0 nodd 




Transformations in two or more variables 








Exponential 


x x y> 


Ci+j)! 


ppos 
1 2 


ppos 
1 2 


|413| 


Evaluation 


f X x 1 


fdiDx 1 


ppos x pPOS 


ppos 
1 d 


|349| 


Polar derivative 


f 


(x-d)f 


ppos 
1 d 


ppos 
1 d 


|348| 



a > n-2 > 



a > n-2 > 



d = 1,2,.. 



ae RHP 
ae RHP 



Empirical tables of transformations 



In the table on the next page we list the results of some empirical testing. 
We constructed ten transformations for each polynomial family. We use the 
abbreviations 



The transformation did not appear to map any interval to P. 

X The transformation appears to map P x — > P. 

K The transformation appears to map p' -00 ' 00 ' — > p. 

R + The transformation appears to map p'°'°°> — > p. 

R~ The transformation appears to map p ( ~°°' ' — > p. 

I The transformation appears to map P' -1 ' 1 ' — ► P. 

I + The transformation appears to map P' 0,1 ' — > P. 

I - The transformation appears to map p' -1 ' ' — > p. 



An underlined entry such as [—1,1] means that the transformation is 
known to map p' -1 ' 1 ' — > P. The other entries are unsolved questions. 

The polynomial families in the first column are in the glossary, except for 
the last three. The Laurent-X polynomials are defined by p_i = 0, po = 1 and 

Laurent-1 p n+i = x(p n -p n _!) 

Laurent-2 p n+i =x(p n -np n _i) 

Laurent-3 Pn+i = (x-n)p n -nxp n _i 

There are some cavaets on a few entries. 

P n x n I— > PnX d ~ n proved for (—oo,0) 
(*) x n i ► p n only proved for each half 
(x;l/2) x n i ► Pti/tl! perhaps it's (— oo, 0) U (0, oo) 
The second table lists the actions of just a single transformation x n i— > 
polynomial on various subsets of the complex plane. 
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Pn 


Pn X 
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^.n n rev 

* Kn 


X 

i 

Pn 


,,n 
X 

1 
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1 

Pd-n 


X 

1 

VnVd-n 


TV 

X 

1 

PnX d - n 


„ , TT 

X 

1 
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TV 

X 

1 

n!p n 
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M+ 
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I 
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I 
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1+ 
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I 
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R+ 
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|£- 
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R+UM- 


R-I+ 


si 

R+ 




Laurent-2 


K+ 


Si uK ~ 










t+UR" 


R+ 




Laurent-3 


si 


I 






I" 


R+UR- 


I 


R+ 


.1+ 



EMPIRICAL TABLES OF TRANSFORMATIONS 712 



X n 1 ► 


Qi 


Qn 


Qui 


Qiv 


5ti 


JCi(C) 


3 P 


tP 


H n 


-3 


-3 


3 


3 


P 




3 P 




L n 


Qn 


Qn 


Qui 


Qui 




JCi(C) 


Qm P 


5Ci(C) 


Pn 


-3 


-3 


3 


3 






3 




Tn 


-3 


-3 


3 


3 






3 




A n 


















r (2) 


-3 


-3 
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3 
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(S)n 
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<*>n 










IH'l 








(n) 
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— Jfl 
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-JCi(C) 




— Jfl 






-JCi(C) 


Exp 


Qjl 


Q» 
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jP 


2^x n 


















(l/4)(?) x n 
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Qn 


Qui 


Qiv 




JCi(C) 
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Laurent — 1 
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Laurent — 2 
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Laurent — 3 



















Tables of determinants and 
integrals 



We assume that all polynomials in one variable have positive leading coeffi- 
cients. 



a* a k+ i 
aic-i Qk 



a 


i 




-i 


b 


i 


bi+ 


i 




f 


9 






f 


9' 






f 


9 






h 


k 




f 


Af 


Af 


A 2 f 


f 


Af 


Af 


A 2 


f 


f 


Af 


Af 


A 2 f 






f( 


X 


f 


1) 


f( 


X 



dij+i dt+ij+i 

f fx 

*v 'xy 

foo fio 

foi fn 

l/f(xi,yi) l/f(x 2/ yi 

l/f(x 2 ,-yi) l/f(x 2 ,-y 2 ) 



< where ^J 1 atx^ e P has all distinct roots. If they 11061 
are not all distinct, then it is still < unless = 

and a k _iQ k+ i = 0. 



< where f = Y. Q i xl / 9 — X! btx l ,and f <C g. 
<0 where f < g or f<C g. 



HE] 



< where deg(f) = deg(g) + 1 = deg(h) + 1 = |27| 
deg(k) + 2 and for all all a, (3, not both zero, we 
have that af + (3g < ah + (3 k 



< for positive x if Ax > x and f e P pos n F A 
> for positive x if Ax < x and f e P pos n V A 

< for positive x if f e P pos n P sep 

< for positive x and f e P pos n P sep 



12191 
12191 
12191 
12291 



<0 f(x,y) = X dyxV 'is inP 2 andf(x,0) has all dis- 12961 
tinct roots. 



^0 feP d , d>2. 



13181 



^0 f G P d+2 andf(x,y,z) =f o(x)+f 10 (x)u+foi(x)z+ EH 
fn(x)yzH 



> f e P 2 



14581 
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Next, we have examples of three by three determinants, all of which are 
special cases of n by n determinants. All these matrices are totally positive for 
non-negative x. 



0-2 0.1 d 

a 3 a 2 ai 
a 4 Q3 a 2 



Y_ aix 1 e P pos HH 



h fi f 
h h fi 

f4 f3 f 2 



£fi(xhj i eP 2 EMI 



lf(3) if" 

3! 2! 

lf(4) l f (3) 

4! ' 3! ' 



f 

f 

J_f " 
2! 1 



f e P pos 1291 



±f(x + 2) f(x + l) f(x) 
if(x + 3) if(x + 2) f(x + l) 
±f(x + 4) ±f(x + 3) ±f(x + 2) 



f e P pos HH 



27 ( x 



2) n (x + l) n x n 
i(x + 3) n ^(x + 2) n (x + l) n 
i(x + 4r ^(x + 3) n ±(x + 2)' 
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f 1 

f(x + vt)dteP. feP 

. -i 
pi 

f (x + ait) dt e 3 f e P and a > 

. o 
f 1 

f (x + it) dt e 3 f e P 

. o 
pi 

f (x + 1) dt e "Ki f e JCi 

. o 

pi ^ 
<()(x)e Ul dx e 5. cf)(x) T and positive on (0, 1) 

. o 
pi 

f(x,y +it) dt G 3 2 f G p 2 

. o 
pi 

f(x,t)dteP fGPfP 

. o 
pi 

f(x,t)g(x,-t)dteP f,geP2 ep 

. 

pi pl-t 

f(x,t)g(x,s)dsdte P f,geP s 2 ep 



Tables of Polynomials 



Appell polynomials in one variable 

Ao(x) = 1 

Ai(x) = l-2x 

A 2 (x) = l-6x + 6x 2 

A 3 (x) = l-12x + 30x 2 -20x 3 

A 4 (x) = l-20x + 90x 2 -140x 3 + 70x 4 

A 5 (x) = 1 -30 x + 210 x 2 - 560 x 3 + 630x 4 - 252 x 5 



Appell polynomials in two variables 



Ao,o(x,y) =1 

Ai /0 (x,y) =l-2x-y 

A ,i(x,y] =l — x-2y 

A 2/0 (x,y) = l-6x + 6x 2 -2y +6xy+y 2 

A u (x,y) = l-4x + 3x 2 -4y + 8xy+3y 2 

Ao, 2 (x,y) = 1 — 2x + x 2 — 6y + 6xy + 6y 2 

Charlier with parameter 1 

C (x)= 1 

d(x)= -1 + x 

C 2 (x) = l-3x + x 2 

C 3 (x) = -l + 8x-6x 2 + x 3 

C 4 (x) = 1 - 24x + 29x 2 - 10x 3 + x 4 

C 5 (x) = -1 + 89x - 145x 2 + 75x 3 - 15x 4 + x 5 

Charlier in two variables 

C n (x;y) = (-ira(-x) 
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C (x;y) = 1 

Ci(x;y) = x + y 

C 2 (x;y) = x + x 2 + 2xy + y 2 

C 3 (x;y) = 2x + 3x 2 + x 3 + 3xy + 3x 2 y + 3xy 2 + y 3 

C 4 (x;y) = 6x + llx 2 + 6x 3 + x 4 + 8xy + 12x 2 y+4x 3 y + 6xy 2 + 6x 2 y 2 +4xy 3 +y 4 

C 5 (x;y) = 24x + 50x 2 + 35x 3 + 10x 4 + x 5 + 30xy + 55x 2 y + 30x 3 y + 5x 4 y + 
20xy 2 + 30x 2 y 2 + 10x 3 y 2 + 10xy 3 + 10x 2 y 3 + 5xy 4 + y 5 



Chebyshev 





T n (x) 


T (x) = 


1 


Ti(x) = 


X 


T 2 (x) = 


-1 + 2x 2 


T 3 (x) = 


-3x + 4x 3 


T 4 (x) = 


1 - 8x 2 + 8x 4 


T 5 (x) = 


5x - 20x 3 + 16x 5 


Euler 






A n (x) 


Ao(x) = 


1 


Ai(x) = 


X 


A 2 (x) = 


X + X 2 


A 3 (x) = 


x + 4x 2 + x 3 


A 4 (x) = 


x + llx 2 + llx 3 + X 4 


A 5 (x) = 


x + 26x 2 + 66x 3 + 26x 4 + x 5 



Falling factorial 

(x) =1 

(x)i = x 

(x) 2 = — X + x 2 

(x) 3 = 2 x - 3 x 2 + x 3 

(x) 4 = -6 x + 11 x 2 - 6 x 3 + x 4 

(x) 5 = 24 x - 50 x 2 + 35 x 3 - 10 x 4 + x 5 

Falling factorial inverse (Exponential or Bell polynomials) 

BnM 
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BoW =1 

BjM = x 

B 2 (x) = x + x 2 

B 3 (x) =x + 3x 2 + x 3 

B 4 (x) =x + 7x 2 + 6x 3 + x 4 

B 5 (x) =x + 15x 2 + 25x 3 + 10x 4 + x 5 



Gegenbauer 

G n (x) 



bolxj - 


_ 1 
- 1 








= 12x 2 - 2 


G 3 (x) = 


= 32x 3 - 12x 


G 4 (X) = 


= 80x 4 - 48x 2 + 3 


G 5 (x) = 


= 192x 5 - 160x 3 + 24x 


Hermite 




Ho(x) = 


1 


Hi(x) = 


2x 


H 2 (x) = 


-2 + 4x 2 


H 3 (x) = 


-12x + 8x 3 


H 4 (x) = 


12 - 48x 2 + 16x 4 


H 5 (x) = 


120x - 160x 3 + 32x 5 


Laguerre 




Lo(x) = 


1 


Li(x) = 


1-x 


L 2 (x) = 


i(2-4x + x 2 ) 


L 3 (x) = 


1(6- 18x + 9x 2 - x 3 ) 



H n (x) 



U(x) 



U(x)= 2i (24 - 96x + 72x 2 - 16x 3 + x 4 ) 

L 5 (x) = ^ (120 - 600x + 600x 2 - 200x 3 + 25x 4 - x 5 ) 



Q-Laguerre 

1 

-x + 1 

x 2 - 2[2]x + [2] 

-x 3 + 3[3]x 2 -3[2][3]x+[2][3] 
x 4 - 4[4]x 3 + 6 [3] [4]x 2 - 4 [2] [3] [4]x + [2] [3] [4] 



KM = 
If (x) = 
l£(x) = 
Lf (x) = 
Ii(x) = 
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Laguerre in two variables 

U(x;y) = LK(-x) 

Lo(x)= 1 
Li(x;y) = 1 + x + y 

L 2 (x;y) = i(2 + 4x + x 2 + 3y +2xy +y 2 ) 

L 3 (x;y) = 5(6 + 18x + 9x 2 + x 3 + 11 y + 15 xy + 3x 2 y + 6y 2 + 3xy 2 + y 3 ) 
U (x;y) = ^(24 + 96x + 72x 2 + 16x 3 +x 4 + 50y + 104xy +42x 2 y +4x 3 y + 

35y 2 + 36xy 2 + 6x 2 y 2 + 10y 3 + 4xy 3 + y 4 ) 
U (x;y) = 1^(120 + 600x + 600x 2 + 200x 3 + 25x 4 + x 5 + 274y + 770xy + 

470x 2 y +90x 3 y +5x 4 y +225y 2 + 355xy 2 + 120x 2 y 2 + 10x 3 y 2 + 

85y 3 + 70xy 3 + 10x 2 y 3 + 15y 4 +5xy 4 +y 5 ) 

Legendre 

Pn(x) 

P (X)= 1 
PlM = X 

P 2 (x)= I(-l + 3x 2 ) 

P 3 (x)= i(-3x + 5x 3 ) 

P 4 (x)= |(3 - 30 x 2 + 35x 4 ) 

P 5 (x)= |(15x - 70x 3 + 63x 5 ) 

Narayana 

N n (x) 

N (x)= 1 

Ni(x)= x 

N 2 (x) = x 2 + x 

N 3 (x) = x 3 + 3x 2 + x 

N 4 (x) = x 4 + 6x 3 + 6x 2 + x 

N 5 (x) = x 5 + 10x 4 + 20x 3 + 10x 2 + x 

Rising factorial 

<X>n 

(x) =1 

(x)i = x 

(x) 2 = X + X 2 

(x> 3 = 2 x + 3 x 2 + x 3 

(x) 4 = 6 x + 11 x 2 + 6 x 3 + x 4 

(x> 5 = 24 x + 50 x 2 + 35 x 3 + 10 x 4 + x 5 
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Rising factorial inverse 

<*>n _1 

(x)o- 1 =1 

(x) 2 _1 = — X + X 2 

(x) 3 _1 = x - 3 x 2 + x 3 

(x) 4 _1 = -x + 7x 2 -6x 3 +x 4 

(x)^ 1 = x - 15 x 2 + 25 x 3 - 10 x 4 + x 5 



Questions 



There are many unsolved problems about polynomials. 
Polynomials with all real roots 

Question 1. Is the minimum of the roots of the Hadamard product of polyno- 
mials in P' ' 11 (n) attained at f = g = (x + l) n ? 

Question 2. Show that for all n the falling factorial (x) n is the derivative of a 
polynomial with all real roots. (See ll64t 

Question 3. Find all polynomials h(x,y) such that for all f<Cg we have 
h(f,g)eP. 

Question 4. Show that f n/ g n defined below satisfy f n < — f n-i an d that 
g n < — g n -i/ where ( i TL i ) and (•^•j are multinomial coefficients. In addi- 
tion, f n <f n -2 and g n < g n -3- (There is the obvious generalization.) Corol- 
lary [TJ15] shows that f n e P. 




Question 5. Suppose n is a positive integer. Show the following interlacings, 
where B, L, N, E are the Bell, Laguerre, Narayana and Euler polynomials. 

exp -1 B n +i < — exp -1 B n 
exp -1 L n+1 < — exp -1 L n 
exp -1 N n+1 < — exp -1 N n 
exp -1 E n+ i < — exp -1 E n 

Question 6. Under what conditions does $5 a ^ b imply there is a g such that 
poly(c) ^ g for all a < c < b? 

Question 7. (C. Johnson) Given f, g e P, not necessarily interlacing, what is the 
maximum number of complex roots of linear combinations of f and g? Can 
this number be computed from the arrangement of the roots of f and g? 

Question 8 (The Faa di Bruno problem - (See p. [39])). Suppose that g G P. For 
any oc € R and m = 0, 1, . . . , 

1. E m (oc,u) has all real roots. 

2. F m+1 (a,y) <F m (a,y). 
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Polynomials with all positive coefficients 

Question 9. If T: P pos — > P, then what constraints are there on the signs of 

T(x v )? The example x l i— > (— l)^x l shows that the pattern h+ is possible. 

In addition to all signs the same, and all signs alternating, are there any other 
possibilities? 

Question 10. If £ c^x 1 e P pos and a > 1 show that £ afx i € P pos . See ED 
for a questionable argument. Compare Ouestion ll32l 

Question 11. If f e pp° s / consider the set S = {g | f ~ g}. Unlike the case 
where ~ is replaced by < , the set S is not a cone. However, it is a union of a 
collection of cones. What is a usefful description of S? 

Matrices that preserve interlacing 

Question 12. If M = (fy ) is a matrix of polynomials with positive leading 
coefficients that preserves mutually interlacing polynomials then does (fy ) 
also preserve mutual interlacing? If so, the following would be true: 

Write M — Y. M-i x 1 where each Mi is a matrix of constants. Each 
Mi is totally positive2- 

Question 13. If (fi, . . . , f n ) is a sequence of mutually interlacing polynomials, 
then is {(at) S R n \ Y_ a ih € P} a convex set? 

Question 14. Describe all matrices M of polynomials with the property that 
if v is a vector of mutually interlacing polynomials then the vector Mv also 
consists of mutually interlacing polynomials. 

Question 15. A question related to the previous one. If v is fixed, describe all 
matrices M such that M x v is mutually interlacing, for all positive integers i. 

A special case. Find all M = ( I 2 ) suc h that M 1 ( J ) consists of interlacing 
polynomials for all positive integers i. 

Question 16. If f < g and fF — gG = 1 where deg(F) = deg(G) — 1 = deg(f) —2 
then are F and G unique? 

Question 17. Find all d by d matrices that are P-positive semi-definite. That is, 
iff e Pis^aiX 1 , andA = (a , . . . , a d _i), then AQA* ^ 0. 

Are all P-positive semi-definite matrices the sum of a positive semi- 
definite matrix and a P-positive semi-definite matrix whose non-zero entries 
are only on the anti-diagonal? 

Homogeneous Polynomials 

Question 18. Given f g P, are there a, b, c € K and g e P such that 
(" + b£ + c4)g = f? 

Question 19. If T: P pos — ► P pos then when is cp(T) totally positive? Itappears 
to be so for the Laguerre transformation. 
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Analytic functions 

Question 20. Which hypergeometric functions are in P? 

Question 21. Which polynomials f (x,y) have the property that f (x, sinx) £ P? 
How about f(x,e x ) e P? 

Question 22. Is there a q -analog T q of the gamma function for which F q (z + 
l)/r q (kz + 1) is entire, and only has negative zeros? 

Question 23. If f e P, and for all a e K we know that f + a e P then show that 
f = ax + b. This is trivial when f is a polynomial. 

Linear Transformations of polynomials 

Question 24. Characterize all linear transformations T (see 1 16.3. It ) defined by 
T(x n ) = nr=i( x + a t) that map P pos to P. 

Question 25. Can we characterize transformations T that satisfy 

J. ppos y ppos 

T- 1 . palt y palt 

Question 26. Can we characterize transformations T that satisfy T: P alt — ► 
P pos and T 2 is the identity? The La guerre transformation is such an example. 

Question 27. Characterize all linear transformations T : P — ► P that commute 
with differentiation (DT = TD). 

Question 28. Suppose that V is the infinite dimensional vector space of all 
linear transformations from polynomials to polynomials, and W is the subset 
of V consisting of all linear transformations that map P pos — > P. Then W 
contains no open sets. 

Question 29. If T(x n ) = d n x n satisfies T(P pos ) c P pos then does T preserve 
interlacing? 

Question 30. If T : P — ► P is a linear transformation then describe the possi- 
ble sets S = {deg(T(x)), deg(T(x 2 )), deg(T(x 3 )), ■ • •}. Since T(x n ) and T(x n+1 ) 
interlace, we know that adjacent terms of S differ by at most 1. The example 
f i— > f ( D) g shows that S can be bounded. 

Question 31. Can we find linear transformations T such that 

1. T: P — ► P. 

2. Tfx 1 ) = x l for sC i < r 

3. T(x T ) ^ x T 

x n i ► HrJx) works for r = 2. 
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Linear transformations that preserve roots 

Question 32. Show that 

1. T(x l ) = (x) i (x) n _ t maps P pos — ► P pos . 

2. T(x l ) = (x) i (x) n _ t maps P pos — ► P. 
Question 33. Lie and Wang [117] showed that 

B n +i B n+ 2 
B n B n+ i 

has all positive coefficients where B n are the Bell polynomials (p. 11891 . Show 
that the determinant is actually stable. 

Question 34. Show that the linear transformation x 1 i— * rL(x)x n ^ maps 
P ( -°°' 2) (u) — >P. 

Question 35. If T(H n ) = x n , then T does not map P, P Qlt or P pos to P. Show 
that EXP o T and T o EXP both map P to itself. 

Question 36. Show that the transformation x k i— > Hfl where Hn is the q- 
Hermite polynomial maps p' — — > P for q > 1. 

Question 37. Suppose n > 1 + rt_i are positive constants. Show that the 
transformation x n i— > riT=i( x — r maps P 10 ' 11 — > P. 

Question 38. Show that the transformation x 1 1— > REV , the reverse of the Leg- 
endre polynomial, maps P alt U P pos — > pp° s xh e transformation satisfies 

?n -I- 1 x 2 
T(X~ +1 ) = f^. T(x n) _ * T((x n n 

n + 1 n + 1 

Question 39. Show that the transformation x l i— > P^, where Pt(x) is the Legen- 
dre polynomial, maps P 1-1 ' 1 ' — > P' -1 ' 1 '. 

Question 40. For which cc, |3 is it the case that T: P — > P, where T: x 1 i— > — h— , 
and P"' 13 (x) is the Jacobi polynomial? 

Question 41. If T is the linear transformation x n T n and Mz — then 
T M : P ( ~ u) -> P pos - See Lemma ESU 

Question 42. Show that the transformation Ei i— » x 1 , where is the Euler 
polynomial, maps P' 1 ' 00 ' — ► P alt . 

Question 43. Show that the exponential diamond product f g = 

EXP 

EXP -1 (EXPf x EXPg) is a bilinear map P pos x P alt — ► P. This map is 
equivalent to x l Ox' = ( i+i )x i+ >. 

EXP 1 

Question 44. Show that if T(x x ) = Lt(x), the Laguerre polynomial, then the 
Laguerre diamond product f g = T~ : (T(f ) x T(g)) is a bilinear map p' 1 ' 00 ' x 

p(— 00,1) > p 
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Question 45. If T: x n i— > (x) n then the diamond product fOg = 
T- 1 (T(f) x T(g)) mapsP x P pos — > P. See Lemma 1 11. 1031 

Question 46. We can modify the definition of diamond product by replacing 
multiplication by the Hadamard product. Given a linear transformation T, 
define S by 

f x g i ^ Tf x Tg 

S : Hadamard product 

T- 1 (Tf * Tg) 1 Tf * Tg 

(A) LetT(x n ) = (x) n . Show that S: P* x P± — ► P. 

(B) Show that the above result holds if we use the modified Hadamard prod- 
uct x T *' x T — r! x r . 

(C) If T(Ht) = x\ then show that 

<J. ^p(00,-l) y p(l,C»)j x /p(00,— 1) y p(l,C»)j > p 

Question 47. If T(x n ) = Pm' 3 , the n-th Jacobi polynomial, for which a, (3 does 
T map P~ ' to itself? Some computations show that there are examples that 
do not map p -1 ' 1 to itself, but they are all where a ^ |3 and — 1 < a, |3 < 0. 

Question 48. Show that the transformation x x 1— > Yli^ + i^Y x ) — ( x ;^l)i 
mapsP pos UP Qlt — >P. 

Question 49. For which a, b does the transformation (x; a) 1— * (x; b) map P — ► 
P? 

Question 50. What assumptions on S,T do we need to make the following 
true? 

If S, T are linear transformations that map P pos to itself, then define the 
transformation U on P pos (n) by x i i-> T(x i )S(x n - i )- If S,T map P alt to itself 
then U maps P alt (n) to itself and preserves interlacing. 

Question 51. Determine all orthogonal polynomial families {p n } so that x n 
p n maps P 1 to P for some interval I. 

Question 52. Identify a polynomial f in P (n) with ctf in projective n + 1 space. 
What is 

VolDP(n+l) 
VolP(n) 

Question 53. If T: P — > P, then is it possible to compute ^^nffi^ - What are 
upper and lower bounds for this ratio? Should we restrict ourself to P(n)? 

Question 54. Let all denote all polynomials. If T : P — ► P and V v l J l ^p^ is 

small then ~^Yon|n^^^ is large. 
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Question 55. Suppose that T: P(n) — ► P(n). What are the obstructions to 
extending T to a linear transformation that maps P(n + 1) to itself? 

Question 56. Show that if Tfx 1 -) = (oc) i x 1 where a < n — 2 is not an integer 
then T((x + l) n ) £ P. See LemmaEZl 

Question 57. When is it true that if f € p? ^ then for all positive integers i 

f(i) 2 >f(i-l)f(i + l). 

Corollary [734] shows that the inequality with > in place of > is true. 
Question 58. Suppose that pt is a sequence of polynomials defined by 

Po = 1 
Pi =x 

Pn+1 = Xp n - C n pn-1 

where the c n are constants. If the linear transformation x n i— > p n maps P to 
itself then are the p^ essentially the Hermite polynomials? 

Question 59. If T(x k ) = %^ then show thatT: p^' - 1 ' — ► p*\(M). 

Question 60. If T(x k ) = then show that for any a e (0,1) there is an n 
such that T(x — a) n does not have all real roots. 

Question 61. If T is a linear transformation, and T(x — r) n G P for all positive 
integers n and all r satisfying a ^ r < b, then is it true that T: p' a ' b ' — > p? 

Question 62. Find examples of transformations T and constant a for which 
T(x + a) n is a scalar, for all n. This is satisfied by T(x x ) = ( x + n — t ) 

Question 63. If T: x n (x)n/ n ' then show that T maps polynomials whose 
roots all have real part 1/2 to polynomials with real part 1/2. [142] 

Question 64. In Example II. 851 we saw that the commutator of the two trans- 
formations x n i—* N n and x n i— ► Dx 11 maps the sequence {x 71 } to an interlacing 
sequence. It appears that more is true. An entry in Table |64j such as P at 
the intersection of the row A n and column B n means that empirical evidence 
suggests that 

Ti : x n A n T 2 : x n ^ B n =*> TjT 2 - T 2 Tj : P pos — > P 



Affine Transformations 

Question 65. Show that if f (x) < f (x + 2) then 



f(x+l) f(x) 
f(x+2) f(x+l) 



>0 
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N n An 


Bn 


D 


Ln 


(S>n 




P 


P 


ppos 


ppos 




An 




P 


ppos 


ppos 




Bn 






•K 


ppos 


P 


D 








p(-oo,0] 


ppos 



ipos 



Table 1: Purported range of P pos under the commutator map 

Question 66. If T(x n ) is H u (x),x n /n!, or L n [x) then show that the minimum 
separation is achieved at the falling factorial. That is, if f € P sep show that 

6T(f)^6T(x) n . 

Question 67. Is (x) i i-> (x) n -i abijection on P sep (n)? 

Question 68. Show that x n i— > L n (L n is the Laguerre polynomial) maps P sep 
to itself. 

Question 69. Show that (x) n i-> x n maps P alt to P sep . 
Question 70. Suppose all r\ are positive and define 

n 

T(x Ti )=n(x- T o 

If T(x n ) e P sep thenT: P' ' 1 ' — ► P' 1 ' 00 ' n P sep . 

Question 71. If T: P sep — ► P sep satisfies f <Tf then is Tf = <xf + (3A(f )? 
Question 72. If 6(f) > 1 then is there a g with all real roots such that A(g) = f ? 

Question 73. Since ni=o — 2i) is a polynomial of degree 2n in P sep , we 
know that 

2n-l 

Pn=A n ]^[(x-2i)eP sep (n) 

i=0 

Although it is not the case that p n <p n -i, it appears to be nearly true. Show 
that if g n = p n M/(x — a) where a is the largest root of p n thenp n -i ^g n - 

Question 74. Which series f (x) = Y.'o a i xX have the property that f * g is in P A 
if g € P A ? This is an affine analog of Theorem 1 15. 221 

Question 75. If f has all its roots in (—1, 0) and T: x n — ► (x) n then the roots of 
Tf satisfy: the largest is larger than all roots of f , and the rest are all less than 
-1. 

Question 76. Show that the map x n — > (x) n is a bijection between P pos and 
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Question 77. For any A is there an infinite family of polynomials h n so that 

• D A h n = (const)h n _! 

• h n e p a 

Question 78. If {ai} is a sequence such that g < Y_ tug' 1 ' whenever g is (x)^ 
then do we have interlacing for all g S P sep ? 

Question 79. If f is a polynomial such that f (D A )P A c P A then is f e P A ? 

Question 80. Does Tf = xf(x/2) — A(f) preserve roots in some appropriate 
space? 

Question 81. If f(x)>g(x), f(x)>f(x + 1), g(x)>g(x + 1) then is 



f(x+l) g(x+l) 



^0? 



Question 82. What are conditions on f that guarantee that | 1, 3£ | is never 



zero? This determinant is also equal to 



f(x) f(x+l) 
f(x+l) f(x+2) 



Question 83. Show that the positive part of the q-Hermite polynomials is in 

pA 

Question 84. For which polynomials does the q-Hermite transformation x n i— ► 
Hn (see § 1818.1211 preserve real roots? 

Question 85. Recall 6(f) is the minimum distance between roots of f. What is 
inff it is 2 for quadratic and 1 for linear. 

Question 86. What is inf f gyjy^jy? 

Question 87. For which polynomials F(x,z) is f"(x,A) : P sep — ► P sep ? How 
aboutF(x,A) : P sep — > P sep ? 

Question 88. Consider the recurrence 

Pn+2 = (a n x + b n + (c n x + d n )A)p n+1 - e n Ap n 

What conditions on the coefficients do we need so that all p n e P sep , and 
Pn+i<Pn for alln? 

Question 89. Show that the real parts of the roots of A k [(x) n (x) n ] satisfy these 
two properties. 

1. The real parts are all integers, or integers plus 1/2. 

2. If k = n + r where r ^ — 1 then the real parts are all equal, and have 
value -(2n + r-l)/2. 

Question 90. Show the following: 

The map fxg^f«g satisfies P sep n P alt x P sep n P alt — ► P sep n P alt . 
(See p.l23i1) 
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Question 91. Find an explicit formula for the q-Laguerre polynomials. It is 
probably similar to the definition of the Laguerre polynomials: 

i=0 v 7 



Question 92. Suppose that f t , gt, h t are locally interlacing families with p(f ) ^ 
1/ p(g) ^ 1/ P(h) 1. Show that the triple convolution below is in P. 

f 1 f 1_t 

ftMg s (x)hi_ s _ t (x) ds dt 

Jo Jo 

Question 93. Suppose that f e P A where Ax = qx with q > 0. Show that if 
f = Y_ ciix 1 then (see !247l l 

a 2 k+ i > 1 [k + 2][n-k] 
QkQk+2 q [k+l][n-k-l]' 

Question 94. Suppose that Ax = qx where q > 1, Bx = q 1/2 x and D^ m f (x) = 
(Af - A^fjAAx - A" 1 *). Show that if f e P A then D^ um f e P B . 

Question 95. Suppose that f <f; A 2 f and the polynomial f + lAf has roots r\ + 
isi, . . .,r n + is n , and g = (x — rj) • • • (x — r n ). Show that 

f < g< Af. 

(It then follows that Af<cAg<cA 2 f. Since f^A 2 f the sequence 
f, g, Af, Ag, A 2 f is mutually interlacing, and hence g € P A .) 

Question 96. Define the q-Charlier polynomials 1125J by 

C n +i(x, a;q) = (x - aq n - [n])C n (x, a; q) - a[n]q n_1 C n _i(x, a; q), 

where C_i(x, a; q) =0 and Co(x, a; q) = 1. If q > 1 and a > 1 then show that 
C n (x, a; q ) is in P A , where Ax = qx. 

Polynomials in P2 

Question 97. If f e Pf p then show that A x (f) = f (x+ l,y) - f (x,y) is in Pf p . 
Question 98. Show that the linear transformation x\j j (x)i(u)- maps P2 to 
itself. 

Question 99. Suppose that we have three polynomials f, g, h e P2 where af + 
bg + ch e ±P2 for all a, b, c e M. Are f, g, h necessarily linearly dependent? 

Question 100. If f(x,y) is a product of linear factors, then for which linear 
transformations T is Tf ever a product of linear factors? 
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Question 101. Suppose that f e P 2 and write 



f = f + fiy +f 2 y 2 + --- + f n y n . 



Consider the d x d determinant 



det d 



fo 
fi 
h 



fi 
f 2 
f 3 



fd-i 
fd 

fd+l 



fd-1 



fd 



f2d-2 



Empirical evidence suggests that 

• If d = 2r is even and d < n then det^ always has the same sign (— l) r . 

• If d is odd then the determinant has positive and negative values. 

This is similar to results for matrices of orthogonal polynomials found in |99]. 
The case where f is in Y^ s is discussed in § 1414.51 

Question 102. (The total positivity conjecture) Suppose that f e P^fn) is a 
product of linear terms, and write f = Y. a ij xX V' ■ When is the matrix below 
totally positive? (Lemma l9.120l is a special case.) 



Question 103. We can form polynomials from the coefficients of a polynomial 
in P2. Suppose that f = Y Oy-xhj 1 . Define M(i, j; d) to be the determinant of 
the d by d matrix (ai+ T ,j+ s ), where ^ r, s < d. For example, M(i, j, 1) = ay, 
and M(i,j;2) = ay 01+1,5+1 - ay+ia i+ y. 
When is Y M(i, j; d)xV € ±P 2 ? 

Question 104. Show that the product of two polynomials in with totally 
positive coefficients has totally positive coefficients. 

Question 105. Suppose f = Y. a i xl where f e P pos (n). The polynomial f {x+y ) 
is in P 2 and 



Qod 



add 



QdO 



n 



f(x + y) = ^aiix + yY 
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Show that the following matrix is totally positive 

(.,. a 3 a 2 Qi a \ 

. . . 3ci3 2a 2 ai 

... 3a 3 a 2 

... a 3 

V •••/ 

This is a stronger statement than Theorem l4.23l 

Question 106. Suppose that f [x,y ) e P 2 is positive whenever x and y are posi- 
tive. This is the case if f e Pi? 05 ; it also holds for e -*y-* 2 . When is 1/f is totally 
positive on ? 

Question 107. The conjecture in [76 1 is that if A — (ay ) is a real n by n matrix 
with non-negative entries which are weakly increasing down columns then 
the permanent of A + xj has all real roots. 

Show that the permanents of all the submatrices of A + xj obtained by 
deleting any row and column have a common interlacing with per(A + xj). 

Question 108. If a linear transformation is defined on P 2 , and T maps all prod- 
ucts Yliv + Q t x + hi) to P 2 , then does T map P 2 to itself? 
Question 109. Which products have coefficients satisfying a recurrence? That 
is, given a product Yll a i + D iU + Ct) — Y. ^ii x )V x Ior which at, bi, Ct do we 
have fi = xatfi-i + |3tfi_ 2 for constants oti, pi.. 

Question 110. Given f e P (n) for which r can we find a product of n + r terms 
ax + by + c so that the coefficient of y r is f? We can always do it for r = 1, 
since all we need do is choose g < f . 

Question 111. If fo < fi < f 2 then is there a polynomial fo + hMv + ^iMv 2 + 
■ ■ ■ + fny n that satisfies x-substitution? 

Question 112. If f G P 2 and n( Q i x + biU + Ci) < f then is 

Question 113. The Legendre polynomials can be defined by P n M = 
-^r, D n (x 2 — l) n . This leads to the guess that the map 

might map P 2 to P for some constants a n/m - This is not true when all cc n rrl 
are equal to 1. For instance, the image of (x + u + 16)(x + 4u +25) under this 
map has complex roots. Is there a choice of constants for which it is true? 

Question 114. What are the different isotopy types of graphs of polynomials 

in P 2 (n)?. 

Question 115. If f e P 2 then let p n be an eigenpolynomial. This means that 
there exists a A n such that f (x, D)p n = A n p n . See Lemma [15. 69 1 



1. When do they exist? 
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2. When is p n € P? 

3. When does p n+ i <p n ? 

Question 116. For which diagonal matrices D and symmetric matrices C is the 
integral below in P2? 

rl 

|xI+uD + e tc |dt 

Question 117. Let C be a function from the reals to symmetric matrices. Define 

f(x,t) = |xI+C(t)|. 

For every t we kare given that f(x,t) € P. What assumptions on C must we 
make so that f e P2? Consider C(t) = e tE , where E is symmetric. The roots 
of f(x,t) are the negative eigenvalues of e tE , and they are {e tAi } where the 
eigenvalues of E are {At}. Is this in P2? 

Question 118. If f (x,y ) e P2, and cr has positive real and imaginary parts then 
we know that the roots of f (x, cr) all lie in the lower half plane. Show that the 
argument of a root p of f (x, cr) satisfies 

\n — arg(p)| ^2arg(cr). 

Question 119. If f e Pf , f = L hMy\ fo = Lk f 2kM, fi(x) = L k WW 
then all the roots of fo and of f 1 have negative real part (they may be complex). 

Question 120. Can we remove the factorials from Corollary I4.28P Suppose 
f e pf° s / d is a positive integer, and a > 0. Then show that 



f(a+d) f(a+d-l) 



f(a + 2d) f(a + 2d-l) 



f(a) 



f(a+ d) 



^0 



Question 121. If d is odd show that the derivative of N(f) (see d4.5.4t ) is posi- 
tive for all x 



Question 122. If d is even show that N (f ) (see (|4.5.4| |) is positive for all x. (Not 
just for positive x.) 

Question 123. Is the set of all extensions of two polynomials in P pos a convex 
set? More precisely is it the convex hull of the n! polynomials arising from 
products? (p. [327) 

Question 124. (The analog of common interlacing for P%) If f, g e P2 and od + |3g € 
P2 for all non-negative a, (3 then is there an h 6 P2 such that f < h and g < h? 

Using the one variable result, it is easy to see that there is a continuous 
function fi such that the graph of h interlaces the graphs of f and g. Can we 
choose such an h to be in P2? 
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Question 125. If f = £ a i,j xl V J G P P men the n — 2 by n — 2 matrix 

has L 1 ^] negative eigenvalues and positive eigenvalues. 

Question 126. (A generalization of Kurtz's theorem(p. |128l l) Suppose that f = 
Y.i.+j< n a i,i xl y i has all positive coefficients. If for all relevant i, j the following 
inequalities hold then f e P 2 . 



Q-i-1,5 a i+l,j Qi,j<li+l,j+l CLy+iCU+y+i 

In other words, if all rhombus inequalities hold with constant 2 then f e P 2 . 
It is a consequence of these inequalities that a?, > 4a.i / j_iay+i and so by 
Kurtz's theorem we know that all ft S P pos where f = £ fiMUv 

Polynomials in several variables 

Question 127. Given f < g in P pos is there £ hij MyV e P^ os such that 



hoo = f hio = g 



hoo hio 
hoi h n 



Question 128. For a fixed polynomial f in P^ what are the possible dimensions 
of the cone of interlacing of f ? 

Question 129. Can every polynomial is Pa be realized as a coefficient of a prod- 
uct of linear terms with a large number of variables? 

Question 130. If f = £ cuMy 1 , g = £ bjMy 1 , f e I*£ e (n) and g e P p d ° + S e (n), 
then when is the "Hadamard Product" below in P^ +e (n)? 

^a.fxMxjy 1 

It appears to hold for d = 0, e = 2, or d = e = 1. If it held for d = e = 1 then 
we could easily derive a consequence due to 1 66 ] : If fo < fi and go < gi are all 
in P pos then fo * go < fi * gi- The proof goes like this: We can find f , g € P^ 
such that 

f(x / v)=f (x)+fi(x)i/ + --- 
g(x,y) = g (x) + gi(x)y H 

By Question[l30] 

fo * go + h * giy H — e P 2 



Since the coefficients interlace, fo * g < fi * g 
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Question 131. The "generalized Legendre" polynomials are of the form 

They do not satisfy substitution for x(s in the unit ball. 

1. Show that the one variable polynomials that are coefficients of monomi- 
als of the form x^Xg 3 • • • x% r are in P, where the ei are positive integers. 

2. If the case r = 2 show that if 

W*W> = (£) 2, (4)V + v>-i)-< 

thenf 2i ,2,(x,y) G P 2 . 

Question 132. If e > 1 and £ Qix 1 G P£", then £ afx 1 G P^ 05 . 

Question 133. Suppose that Qi, . . . , Qd are negative subdefinite matrices with 
the property that for all positive ai, . . . , a d the matrix aiQi + • • • + a d Q d is 
negative subdefinite. (Find a general construction for such matrices.) Show 
thatif f(x) e pP° s (n) then 

(-l) n f(-xQ 1 x t / ...,-xQ d x t ) G P d (2n) 

Question 134. Suppose that T: P d — ► P is a linear transformation with the 
property that monomials map to monomials. Can we characterize such T? 
Are they constructed out of maps from P 2 — > P? And, are these of the form 
xhj' 1 ► z 1+ ', x 1 !)' 1 ► aV, coefficient extraction, and xhj' (j).z i_1 ? For 
instance, if it's true that xhj' i-> (j).z' _l maps P 2 — > P, then we have a map 

(x,i/,v)n(x,x,v) f(x,D) 
P 3 ^ P 2 ^ p 



x i y j v k! ^ x i+j v k| > (i+i) 1c z 1c ~ t ~ i 

More generally, can we describe all multiplier transformations? These are 
linear transformations Pd — ► P e that map monomials to monomials. 

Question 135. Prove the following. Note that the only difficulty is to replace 
the zero entries of Q with small positive values so that there is still exactly one 
positive eigenvalue. There is only a problem if the determinant is zero. 

Lemma 24.32. If Q is a matrix with 

1. all non-negative entries 

2. exactly one positive eigenvalue 

then Q is a limit of negative subdefinite matrices. 
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Question 136. Show that the Hurwitz transformation 1 115.9. 1) maps P^ os to it- 
self. 

x i] ■ • ■ x td 

Question 137. Show that the linear transformation xj 1 • • • x 1 ^ 1 t— > — : d — - 

[tl H i d )' 

maps P^ os to itself. (It does not map P 2 to itself.) 

Question 138. Suppose that fo,o = 1 and fy satisfies ([10.24.2l l where the con- 
stants satisfy by and 

Qo,o «S ao,j < 11,0 < ay sC a 2/0 a 2/j < a 3/0 • ■ ■ for j = 1, 2, . . . 

Then we have fy <C f t,o for all i, j . Consecutive f y do not necessarily interlace. 

Question 139. For which regions D c K 2 is it the case that for all f € ¥ s ^ p the 
integral below is in P? 

f(x,s,t) ds dt 

. D 

Question 140. For which regions D c K 2 is it the case that for all f,g g the 
integral below is in P? 

f(x, s)g(x,t) ds dt 

. D 

In this case, the conclusion holds if D is the product (0, 1) x (0, 1) or if D is the 
segment < s, t and s + t = 1 or if D is the triangle < s, t and s + t < 1. 

Question 141. Suppose that foo <.foi/fio- Show that fiofoi — fooW ^ if and 
only if there is an extension of foo, fiO/ foi such that W = fn. (See p. 13611 ) 

Question 142. If f(x,y) has the property that f (a,y) < g(a,y + 1) and 
f (x, a) < f (x + 1, a) for all real a does it follow that f e P sep ? 

The polynomial closure of Ph 

Question 143. We know that a+ bx+ cy + dxy is in P2 for any choice of a, b, c, d 
satisfying | £ a I < ^ or which choices of constants ai do we have 

Y_ Q i xI e ? d 

where the sum is over all 2 d monomials I? For instance, if d = 3 a determinant 
such as 

x + a di d 2 
di y + b d 3 
d 2 d 3 z + c 

has the desired form. Branden's criterion generally leads to infeasible prob- 
lems 

Question 144. Suppose that L is a lattice in Z d with basis vectors v = 
{vi, . . . ,v e }. If I = (ii, . . . ,i e ) is an index set then set I • v = ijvi + • • • + i e v e . 
When is it true that 
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If f ( x ) = £ aix 1 is in P^ 08 then £ a^x 1 e P p e ° s 

Notice that by taking Vi = (2) the Hurwitz theorem on even and odd parts 
(Theorem 17.641 1 is a consequence of this question. Empirically, the question 
holds for V! = (3,3) or (4,4) or (2,4), and fails for Vi = (3,2) or (2,1). The 
question fails if we replace P^ by P2. 

The diagonal (Theorem ll3.91 of a polynomial in P2 is found by taking vi = 
(1,1). For another example, take f = (x + y + z + l) n , vi = (1,1,0) and 
V2 = (0,0,1). The resulting polynomial is in P2, and the coefficient of x£ is 

Question 145. If f G Pa has degree 1 in each variable then is there a d by d 
matrix A and a diagonal matrix of the form (di.x0 such that det(A + D) = f? 
In particular, can xuz — x — y — z + lbeso represented? 

Question 146. Suppose f = £ ayx l u> e P2, and define f rs = Y. ^i+r^j+s* 1 !^ 
where r, s e {0, 1}. Show that 

foo + uf 10 + uf i + uvfn e P4 

Is this true in the special case that f is a product of linear factors? 
Question 147. Suppose that g e P2 0S (n) and write 

TL 

g(x,y) = Y_ 9iMv x - Define 

i=0 

n 

i=0 

then the absolute value of the argument of any root of f is at least . 

Note that if n = 1 then this states that go(x 2 ) + xgi(x 2 ) has negative real 
part for go < — gi - the classic Hurwitz stability result. 

Question 148. Suppose that fj, . . . , f n are mutually interlacing. Are there pos- 
itive Qi so that 

n 

£a t f t (x)y i eF 2 ? 

1=1 

A special case of this: suppose the roots of f are T\ ^ ■ ■ • ^ r n . Are there 
positive at so that 

Question 149. Let T(x n ) = P n , the Legendre polynomial. Show that if f e P2, 
then T*(f)(a,y) 6 P for |a| < 1. The case f = (-1 + xy) n is |l68j page 387, 
problem 69] and ( |14.3.1t . 
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Question 150. Recall the Hermite polynomials H T/S in Theorem 113.351 We 
know that we can represent H r s as the determinant of a matrix. Show that 
we have the following representation of H r i as the determinant of the tridi- 
agonal matrix 

j_^jT+l e ax 2 +2bx+C'y 2 _I e -(ax 2 +2bxy + cy 2 ) _ 

dx r 9y 

2(ax + by) Via ... ... 

Via 2(ax + by) \/ia ... ... 

Via 2(ax+by) V6a ... ... 



... ... v / 2(r-2)a 2(ax + by) V 2 ( r ~V a 

... ... V 1 ^ !) a 2(ax+by) V2rb 

... ... \/2rb 2(bx + cy) 

Question 151. For which symmetric matrices Q is xQy 1 e P2d? Setting x = y 
shows that xQx* e so Q is negative subdefinite. 

Question 152. Suppose that f,g,h,k€ P satisfy 
9 

1. Interlacing: f k 

h 

2. Determinant: | £ £ | < 
Show that f+yg+zh + yzke P3. 

The analytic closure of Pa 

Question 153. If L n is the Legendre polynomial then 1168. page 69] 

00 

f(x,y) = Y_ KMy n = (1 - 2xy +y 2 )- : / 2 

i=0 

If g € P' -00,11 ^) then show that the coefficients of y n , • • • ,y 2n in g(y)f(x,y) 
are in P. The coefficients of y 2 , . . . ,y n ~ 1 are not necessarily in P. 

Question 154. When do all the coefficients of y l in a product of terms of the 
form (a + y ) bx+c (d + y ) rx+s have all real roots? The coefficient of y 3 in (1 + 
y ) x (2 + y ) x is not in P . It seems that all coefficients are inPfor(l+y) x (2— y) x . 

Question 155. Show that the linear transformation x 1 1— > H t where Hi is the 
d-dimensional Hermite polynomial, maps Pa to itself. 
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Question 156. Suppose that {p n } is an orthogonal polynomial system. Can we 
find constants at so that 

oo 

^QiPiMy 1 G P 2 

i=0 



Extending 

Question 157. Some questions about < p 2 - 

1. Is ^2 closed under multiplication? 

2. If f G P pos then is f(-xy) G ^2? 

3. Is e~ xv G *p 2 ? This would follow from the above question. 

4. Does Lemma [TZ4l hold for <£ 2 - 

Question 158. Consider the map x n 1— > L n (x;y). Show that it maps P — ► P^i. 
Question 159. Is \j^/x in the closure of 

Question 160. Which functions analytic in Re(z) > are uniform limits (in 
Re(z) > 0) of polynomials with all real roots? 

Question 161. If f G 8\i then show that f (x,y) determine an operator f (x, D) 
that maps P pos — > P 

Question 162. Is Qd, e closed under differentiation? Are coefficients of Qd, e 
well-behaved?? Are there interesting transformations that preserve Qd,e? 
Question 163. Suppose f, g G Qd,2 have the property that for all p < — q G Qa,2 
it holds that fp + gq G Qd,2- Then g < — f. 

Generating Functions 

Question 164. If ^fi(x) = [x) i then the generating function satisfies 



9 

3x " ' i! J y ~ Jl i! 



Integration with respect to x shows 



>fiM— = : 7— r + C y 

^— x! log(l+y) 



If we choose c(y ) = — 1/y and write 

^— i! log(l+y) y 
then show that gt(x) interlaces (x) i and hence is in P. This solves Question|2] 



d+y) 
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Question 165. Can we use the identity 

s- ■ (y). 

2jA l f)^ = f(x + y) 

to get information about linear combinations of A l f? 
Question 166. Is the following true? 

Suppose that ai...,a T and bi,...,b s are positive and satisfy 
[Y_ bi) — {Y_ Qi) > — 1- Then the Hadamard-type product 

r 

n(ain)! 

±=i x n ra = n 



otherwise 

maps P x p pos — > p. The assumptions on the a's and b's guar- 
antee that the genus is 0. The problem is to show that all the roots 
are real. 

Question 167. In Lemma [15.69| we saw that f (x, 5) had an eigenpolynomial p n 
of degree n for n sufficiently large. Show that there is an eigenpolynomial for 
all n, and that p n < p u -i- 

Question 168. Show that if G e F*2 interlaces e~ xy then G = (ax + by + c)e~ xy 
where a and b have the same sign. 

More generally if f e P2, determine all G such that G € P2 interlaces 

f(x,u)e-*«. 

Recurrences with polynomial coefficients 

Question 169. What are necessary and sufficient conditions for go(x), QdM 

so that there exists a sequence fo, f 1, . . . satisfying 

1. f t eP,i = 0,l,... 

2- Lx d = 9iW = 0forn = 0,l,.... 

Matrices 

Question 170. If f = fT(x + b^u + q.) = Y-ViMv 1 then can we explicitly 
construct a matrix A such that |A[1, . . . , i]| is a multiple of p^? 

Question 171. If A is a d by d matrix of non-negative constants then the se- 
quence 

ho = I 
h n+1 = xAh n - W n 

defines a sequence of d by d matrices whose entries are all in P. 
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Question 172. Suppose that A, A' are diagonal matrices with positive diagonal 
entries, and C, C are symmetric matrices. If xl + yA + C| = |xl + yA' + C'| 
then what is the relationship between A, A', and C, C? 

Question 173. Interlacing was defined for matrices in terms of their character- 
istic polynomials. Is there a notion of "interlacing" for general linear transfor- 
mations? For example, if S and T are compact self-adjoint linear operators on 
a Hilbert space, then it is clear what it means to say that the discrete spectrums 
of S and T interlace. However, what does it mean to say that the continuous 
spectrums interlace? 

Question 174. Suppose M = xI+yD+C where C is symmetric, and D = (dtj ) is 
a diagonal matrix with positive diagonal. If f (x,y) = |M|, and g(x,y ) = |M[1]| 
then define h = f — (x + dny)g. Show that he P2 and f <J h. 

Question 175. Suppose that Po = 0, Pi is the n-dimensional identity matrix, 
and P n +i =xP u - JPn-i where J is the n x n matrix of all l's. The diagonals 
of P n are all equal to a polynomial f n , and the off diagonals are all equal to 
g n . It isn't true that f n e P, but show that g n e P and g n +i < Qn- 

Matrix polynomials 

Question 176. Do all matrix polynomials in Hypj have a common interlacing? 
Question 177. Is Hyp 1 closed under Hadamard products? 
Question 178. If f € Hyp x has degree n then does the function 

x 1 ► maximum eigenvalue of f (x) 

have exactly n zeros? 

Question 179. Suppose that Di, . . . and Ei, . . . are positive definite matrices. 
Show that the determinant of 

(Di+xEi) * (D 2 +xE 2 ) * ••• * (D n + xE n ) 

is a stable polynomial, where * is the Hadamard product. 

More generally, if A, B are matrices with polynomial entries, and |A| and 
B| are stable polynomials then is it true that A * B is stable? 

Note that all the polynomials in question are matrix polynomials in Hypj . 

Polynomials non-vanishing in a region 

Question 180. Determine all the exponential functions in NVi(C) where G is a 
cone in C. 

Upper half plane polynomials 

Question 181. If A is skew-symmetric and f(x) e u n then is f (Ax) e U n ? If 
n = 2 and A = ( ° t J ) then f ( Ax) = f (y, -x) which is in U 2 if f is. 
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Figure 1: The imaginary part of a polynomial in 32 



Question 182. Describe all the polynomials that interlace a given polynomial 
inUi(C). 

Question 183. If f < g in Ui (C), and all coefficients are positive then show that 
f • g' = h(x) + ik(x) where h and k are stable. 

Question 184. If (x — a) (x — t) € Ui (C) then show that (x — u) (x — t) < x — y if 
and only if -v lies in the ellipse whose foci are cr, T, and is tangent to the x axis. 

Question 185. If f e P pos then show that J* f(-ttx) dt e Ui(C). 
A consequence: 
Let h(x) = Jo f (-itx] dt. Then 



f (i - tx) dt 



f(-itx) dt 



f (-itx) dt = h(x) + h(x) = 2m{h) 



Since h G Ui (C), it follows that the integral from —1 to 1 is in P. 

Question 186. If f € P^, f = Y_f\(~x-)y l then all the polynomials 

fiM f i+x (x) 



ft_i(x) f t x) 



are stable. In particular, all their coefficients are posi- 



tive. Thus, the sequence of coefficients of f are q-log-concave. (This is not true 
if f G P2.) More generally, show that the d + 1 by d + 1 determinant of the 
matrix given below is stable. 



fi 

fi+i 



fi+i 

fi+2 



fi+d 
fi+d+1 



i+d 



fi 



+ d+l 



f 



2d 



Question 187. Although we can't graph a polynomial in U2(C) since it has 
complex roots, we can look at the plots of the real and imaginary parts. Since 
all imaginary parts are negative, the graph of the imaginary parts lies in the 
lower half plane; the graph of the real parts has no such restriction. The graph 
of the real part looks like the graph of a polynomial in P2, while the graph of 
the imaginary part looks very different (Figured). Explain this behavior. 
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Stable polynomials 

Question 188. Suppose that f = X ^i( x )v x is m P;j 0S (""•)■ Define the polynomial 

n-2d 

Pd(x,y) = ^ y k det ( f ^+i+i)os;y<d 

k=0 

Show that 

1. pd(x, a) is a stable polynomial for all positive a. 

2. All coefficients (the determinants) are stable polynomials. 

Question 189. If f (x,y ) e then all minors of the matrix d21.10.2t are stable. 

Question 190. If f, g e then all entries of the Bezout matrix B(f, g) are 
stable polynomials in y . 

B(f,g) 

u — V 

Question 191. If f = £ f t (x,y Jz 1 e P^° s and 

/fo(x,yo) ■•■ f o(x,yd)" 

V f d(x,y ) ■•■ fd(x,yd)y 

then the coefficient of any monomial y^ ■ ■ ■ y e ^ in Mdl is stable. If d = 1 this 
is the Bezout matrix. In addition, the determinant of Md is stable. 

Question 192. A special case of the previous question. If Y_ ft My 1 G P^ then 
the polynomial below has all positive coefficients. Moreover, all coefficients 
ofxV areinP pos . 



f (v,y)g(u,y) - f(u,y)g(v,y) 



1 



(x-y)(x-z)(y -z) 



f (x) fi(x) f 2 (x) 

Uv) fi(y) Mv) 

f (z) fi(z) f 2 (z) 



Question 193. If gi, . . . , g n G P pos M and mi, ■ ■ • , m n are positive definite ma- 
trices then 

|giTTli H h gnTTL n | 6 5£ 

Question 194. Is the determinant of a stable matrix polynomial a stable poly- 
nomial? 

Question 195. If f = £ ftMy 1 G P^° s then is 



fi _ fo 

h fi 



Qi 



Qi 



Question 196. Suppose f = T\{x + r t) G pP ° s - what polynomials in P pos 
interlace f in JCj ? 
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Question 197. If f < — g then when are there positive definite Di, D2, skew- 
symmetric A, and positive a, |3 such that 

|l + xDi +yD 2 + A\ = af+|3guH 

Question 198. Suppose that f = Y. ft My 1 is m P2° S ( n k Define the polynomial 



n-2d 



Pd(x,-y) = Y_ y k det ( f k+i+j) <i, 



k=0 



Show that 

1. pd(x, a) is a stable polynomial for all positive a. 

2. All coefficients (the determinants) are stable polynomials. 
Question 199. Define the infinite matrix 



M = 



(\ +2 ) 



V : 

where the periods (.) mean zero. If 



exp(M) 



( ■ ■ ■ 

d0,2 a l,2 Q2.2 

V : 



then show that the rows Y. Q i,j x ' are stable. 

If k = 2 and f n is the polynomial corresponding to the nth row then the 
recurrence 

fn = (X+ n- l)f n -l - 



2 \2 



n-2 



shows that f n € P. Since all entries of the exponential are positive we have 
thatf n e P pos C Jfi. 

p 

Question 200. If f € JC and f (x) > for all real vectors x then are there g < — h 
so that f = g 2 + h 2 . 



Transformations in the complex plane 

Question 201. Show that the containment in Proposition 122.231 is an equality. 
That is, if T(x n ) = L n (x) and z| ^ 1 then all roots of T(x — z) n have non- 
negative real part. 
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Question 202. If T: x n i-> L n (x) then show that A(TP A ) is the closed right half 
plane. Equivalently, show that if Tf has a root with negative real part, then f 
has a root with absolute value greater than 1. 

Question 203. Suppose f p is defined as in J22.2.1|l . Is there a Sendov type result 
for the roots of fp? That is, if all the roots of f are in the unit disk, then how 
far away from a root of f are the roots of f p ? 

Question 204. What linear transformations have the property that they map 
polynomials with all roots on a horizontal line in the complex plane to poly- 
nomials with all roots on a horizontal line? The transformation x n i— > H n is 
such a transformation. 

Question 205. If S is a region in C and T is a linear transformation, then when 
is it the case that 

1. A(TP S ) =A(T(4>S))7 

2. A(TP S ) c Hull( A(T(4)S)) )? 

Question 206. If A is the unit disk in the complex plane, then what are exam- 
ples, other than dilations or powers of the derivative, of linear transformations 
that map P A to itself? 

Question 207. If T(x n ) = p n (x.) where p n is the Legendre or Chebyshev poly- 
nomial, then is it the case that 

T(P A ) C P A ? 

Question 208. What are examples, other than the derivative, of linear transfor- 
mations that map P A to itself? 

Question 209. Let T(x n ) = L n (x). If A is the unit disk and R the right half 
plane, then 

T(P A ) C P R 

That is, if all roots of f have magnitude at most 1, then the roots of Tf have 
non-negative real part. Is it also the case that every point in the right half 
plane is the root of Tf, for some f € P A ? 



Notes 



Polynomials 

Interlacing has several synonyms in the literature. Gantmacher [65J uses the 
term positive pair for interlacing polynomials with all negative roots. In [63 [ a 
pair of polynomials f , g with all real roots satisfying f < g or f 3> g is called a 
real pair. If they satisfy f 3> g and all roots are negative, they are called a positive 
pair. Wagner |l78j says that f alternates left of g if f 3> g. Bilodeau [14] says that 
q separates p if p < q. In [73] two polynomials with a common interlacing are 
called complementary. In 11581 and 11011 a sequence f < g < h is called a Sturm 
sequence of polynomials. [35] says that two polynomials interlace if the roots se 
seperent (separate themselves). 

Apparently Markov was the first to prove that differentiation preserves 
interlacing. See 11411 . 

Lemma lLTTl is in 11241 . 

Quantitative sign interlacing occurs in several different places. It is im- 
plicit in 11651 page 226]. The case f 3> g is found in [66|. Quantitative sign in- 
terlacing is used in [ 68 1 to prove that the characteristic polynomial of a graph 
is interlaced by the characteristic polynomial of a one point deletion. Exercise 
7 on page 146 of [68| is an explicit statement of quantitative sign interlacing 
for f < g. Quantitative sign interlacing can also be interpreted as a property of 
rational functions. See [155. page 320]. 

[179 1 contains many of the sign-interlacing results of § 1111.21 
Corollary 11.561 is well known; it occurs for example as an exercise in (63l 
page 269]. 

Johnson [95] shows the convexity of the set of polynomials that interlace a 
given polynomial using sign interlacing. 

The basis in Section [L3l is sometimes called the "interpolation basis" [63], 
since they are, up to a multiple, the polynomials in Lagrange interpolation. 
Fuhrman [63] calls the basis fi(x) = Jl^ 1 the spectral basis. 

See |141l page 198] for a different proof of Proposition ll .351 [73] established 
Proposition 11.351 in the case of positive combinations for the special class of 
Hurwitz polynomials. These are polynomials f(x 2 ) where f S P alt has no 
repeated roots. 

The identity [32, page 24] > where the Pi are orthogonal fol- 

lows from Lemma [l.55l using the fact that p n +i < Pn- 

A consequence of Lemma [l.59l is that the associate polynomials of a family 
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of orthogonal polynomials interlace. See [32, page 86]. Lemma fl .591 is often 
used implicitly - see for example 11191 Lemma 3.1] 

1 8] proved Lemma [l.55l in the more general case of differentiable functions. 

Corollary H3l]is in [35]. 

[56 [ has many results about polynomials, with no references. Some of them 
are from [138 1 . This is translation of the book that first appeared in 1954. Here 

are a few of the items in the book: 

problem 727 interlacing implies all linear combinations in P 

problem 728 all linear combinations implies interlacing 

problem 729 f, g interlace implies derivatives interlace. 

problem 730 yf + (A + x)f ' e P if y > or y < -n 

problem 742 if f — a e P and f — b e P, then f-yePforas^y^b 

[131] has a proof of an elementary interlacing property. 

Jahnke [93 . page 153] observes that af + bf ' and cf + df ' interlace if ad — be 
is not zero, where f is a Bessel function. 

Lemma fl. 751 is a version of Kharitonov's theorem [13] for polynomials in 

ppos 

See [87] or [136] for inequalities similar to Example ll.691 
Corollary ll.83l is in HT7J, along with many nice applications. 

Matrices preserving interlacing 

In 1 70] they use the term interlacing negative zeros for a set of mutually interlac- 
ing polynomials (with negative zeros). 

The proof of Proposition [3]22] is based on the argument in Holtz[81 ]. 

Homogeneous 

We could also derive Newton's inequalities from Corollary 14.31 1 using f and 
f. 

Analytic 

Theorem [536] is in [14] . The class P is also known as the Laguerre-Polya class 
£ — 7 [39]. The polynomials T(l + x) n are called Jensen polynomials. A poly- 
nomial in P pos is called type I. 

In 1 45 1 they essentially prove Kurtz' result (Theorem 15. 19b , but with a co- 
efficient of 4.07 instead of 4. 

Linear Transformations 

In [37] they discuss various questions concerning linear transformations, in 
particular what they call linear operators operating on entire functions. For 
instance, they ask for a classification of all linear transformations that take 
polynomials with zeros in a set S (which might be complex) to polynomials 
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with all zero in a set T. In l36l Corollary 2.4 they determine all linear transfor- 
mations T for which f and Tf interlace, but just for transformations that are 
multipliers. The answer is not expressed in terms of derivatives, but rather 
that the multiplier sequence is a nonconstant arithmetic sequence all of whose 
terms have the same sign. 

1 124] proves a special case of Lemma [7j87l 

1 24 J proved the identity of Example 16.461 by using properties of the dia- 
mond product. 

Examples of Linear Transformations 

Szeg6[168] is the source for all the facts about orthogonal polynomials. 

Iserles and Norsett [90] call the inverse of the falling factorial the Laguerre 
transformation. 

In [51 J they study the interlacing properties of the polynomials sequences 
determined by recurrences p n +i = 2r n xp n — (1 — x 2 )p^ for interlacing se- 
quences (r n ) of coefficients. 

See [22] for an identity involving Hadamard products and Laguerre poly- 
nomials that is similar to l|7.10.61 . 

Some results involving linear transformations of polynomials have been 
expressed in umbral notation, which, to my mind, is just confusing. For ex- 
ample, Rota II148II defines modified Hermite polynomials H n (x) by 

If we define T(x n ) = H n (x) and M(x) = \/2x then we have an elegant identity 
T 2 = M. _1 TM. However, this is written in 111481 as 

Hn(H) = 2 n/2 H n (-^=) 

[10[ and [153[ discuss asymptotic properties of the roots of eigenpolyno- 
mials of arbitrary differential operators of the form d7-4.lt . 

Affine Transformations 

Johnson Il95l proves Lemma IS. 26l by a messy algebraic analysis. The idea for 
the q-Laguerre polynomials comes from |156|, who considers various prop- 
erties of more general polynomials. There is no discussion about the roots, 
however. 

The affine derivative was called L( q b ) in [77] . 
Complex Transformations 

There is an analytic proof [91] that the linear transformation x n — > T n maps 
P 1 " 1 ' 1 ' to itself. 
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Wagner 11751 refers to a folklore theorem that is Corollary 122.481 with a = 

tt/3. 

Polynomials in two variables 

Lemma [9]26] is a standard result about asymptotes. See [132] Lemma fe.lSOI can 
be found in 11341 

Polynomials in several variables 
Matrices 

The calculation in Section ll7.2l is from [85. page 187]. 

Smith [88, 160 J extends results on interlacing of Schur complements to sin- 
gular Hermitian matrices using the Moore-Penrose inverse. 

Stable 

Lemma l21.38l is in [186, page 33], but the proof is more involved. 11181 showed 
that the Bell polynomials are q -log-convex. 
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